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I. INTRODUCTION 

Zurek has introduced [l| envariance (environment- 
assisted invariance) in the following way. He imag- 
ined a system S entangled with a dynamically decou- 
pled environment £ altogether described by a bipar- 
tite state vector | ip)ss- Further, he imagined two 
opposite-subsystem unitary operators us and ug 
that "counter-transformed" each other when elevated 
to the composite system Us = {us 8 Is), Us = 
(Is <8> us), and applied to the bipartite state vector, e. 
g-, 

U £ U S \^)ss =W)ss- (1) 

Zurek remarked: "When the transformed property of 
the system can be so " untransformed" by acting only 
on the environment, it is not the property of 5." 
Zurek, further, paraphrases Bohr's famous dictum: "If 
the reader does not find envariance strange, he has not 
understood it." 

The first aim of this study is to acquire a full under- 
standing of envariance. The wish to understand envari- 
ance as much as possible is not motivated only by its 
strangeness, but also by the fact that Zurek makes use of 
it to derive one of the basic laws of quantum mechanics: 
Born's rule. His argument to this purpose gave rise to 
critical comments and inspired analogous attempts Q, 
1, 1, @. 

Since the term "Born's rule" is not widely used, the 
term "probability rule of quantum mechanics" will be 
utilized instead in this article. 

The probability rule in its general form states that if 
E is an event or property (mathematically a projector 
in the state space) of the system, and p is its state 
(mathematically a density operator), then the probability 
of the former in the latter is tr(Ep). (This form of the 
probability rule is called the "trace rule"). It is easy to 
see that an equivalent, and perhaps more practical, form 
of the probability rule is the following: If | cj>) is an 
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arbitrary state vector of the system, then {<f> \ p \ <j>) 
is the probability that in a suitable measurement on the 
system in the state p the event \4>)( ( t , \ will occur. This 
is what is meant by the probability rule in this article. 
(For a proof of the equivalence of the trace rule and the 
probability rule of this article see subsection V.E.) For 
brevity, we'll utilize the state vector | </>) instead of the 
event \(f>)(4>\ throughout. 

All derivations of Born's rule from envariance in the lit- 
erature are restricted to eigen-states ( p \ <f>) = r | <fi) , r 
a positive number). Four of the cited commentators of 
Zurek's argument (I have failed to get in touch with Fine) 
have pointed out to me that the restriction can be un- 
derstood as natural in the context of (previous) system- 
environment interaction, which has led to decoherence 
(see @, Sec. IIIE4), or if one takes the relative-state (or 
many- worlds) view, where the "observer" is so entangled 
with the system in the measurement that the restriction 
covers the general case (cf 0] and see the first quotation 
in subsection IV. A). 

It is the second and basic aim of this investigation to 
follow Zurek's argument in a general and precise form 
using the full power of envariance, and to complete the 
argument to obtain the probability rule, i. e., the formula 
(4> \ p \4>), beyond the approach in terms of the Schmidt 
decomposition (used in the literature). 

In the first subsection of the next section a precise 
and detailed presentation of the Schmidt decomposition 
and of its more specific forms, the canonical Schmidt 
decomposition, and the strong Schmidt decomposition 
is given. In this last, most specific form, the antiuni- 
tary correlation operator U a , the sole correlation en- 
tity inherent in a given bipartite state vector (introduced 
in previous work Q) is made use of. It is the entity 
that turns the Schmidt canonical decomposition into the 
strong Schmidt decomposition, which is complete and 
precise. This entity is lacking in almost all examples of 
the use of the Schmidt decomposition in the literature. 
(For an alternative approach to the correlation opera- 
tor via the antilinear operator representation of bipartite 
state vectors see section 2 in Q.) Twin unitaries, i. e., 
opposite-subsystem unitary operators that act equally on 
a given bipartite state vector, which are hence equivalent 
to envariance, are analysed in detail, and the group of all 
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pairs of them is derived. 

There is another derivation of the full set of envariance 
in the recent literature [lfj. It is algebraic, i. e., in terms 
of matrices and suitable numbers, whereas the approach 
of this study is geometrical, i. e., it is in terms of state 
space decompositions and suitable maps. 

In the second subsection of the next section connection 
between twin unitaries and twin Hermitians, i. e., so- 
called twin observables, studied in detail in pure bipartite 
states in previous articles [H, 11 1, is established. In the 



last subsection of the next section a possibility to extend 
the notion of twin unitaries to mixed bipartite states is 
shortly discussed. Extension to twin Hermitians in mixed 
states was accomplished in previous work fl2l |. 

The second and third subsections of section II are 
not necessary for reading section III, in which, following 
Zurek, a complete argument of obtaining the probability 
rule is presented with the help of the group of all pairs of 
twin unitaries and distance in the Hilbert space of linear 
Hilbert-Schmidt operators. 

In section IV., each of the four re-derivations of Born's 
rule from envariance, and Zurek's most mature Physical 
Review article on the subject, are glossed over and quo- 
tations from them are commented upon from the point 
of view of the version presented in section III. 

In concluding remarks of the last section the main 
points of this work are summed up and commented upon. 



II. MATHEMATICAL INTERLUDE: STRONG 
SCHMIDT DECOMPOSITION AND TWIN 
UNITARIES 

The main investigation is in the first subsection. 



and they depend only on |\l/)i2 and the corresponding 
basis vectors |m)i, and not on the entire basis. 

The generalized expansion coefficients are evaluated 
making use of the partial scalar product: 



Vm : 1 771)2 = ( m li|*) 



12- 



(26) 



The partial scalar product is evaluated expanding 
^ )i2 in arbitrary bases {| k }i : V/c} C 7i\, 
{| V)2 '■ ^1} C Ti.2, and by utilizing the ordinary scalar 
products in the composite and the factor spaces: 

l*>» = EE (< fc li CM*)") l^>i U> 2 - (2c) 

k I 

Then (2b) reads 
Vm : |m) 2 = ]T (V^ 1^ 0| 2 |*) 12 ) |1> 2j (2d) 



and the lhs is independent of the choice of the bases in 
the factor spaces. 

Proof is straightforward. 

Now we define a Schmidt decomposition. It is well 
known and much used in the literature. It is only a 
springboard for the theory presented in this section. 

If in the expansion (2a) besides the basis vectors | m) 1 
also the "expansion coefficients" | tti)^ are orthogonal, 
then one speaks of a Schmidt decomposition. It is usu- 
ally written in terms of normalized second-factor-space 
vectors {| 777)2 : Vm}: 



I*) 12 = ^a m \m)i \m) 2 , 



(3a) 



A. Pure-state twin unitaries 

We take a completely arbitrary bipartite state vector 
I v I / )i2 as given. It is an arbitrary normalized vector 
in Hi ® H2, where the factor spaces are finite- or 
infinite-dimensional complex separable Hilbert spaces. 
The statements 0X6, 0jS cL rule, asymmetric in the roles 
of the two factor spaces. But, as it is well known, for 
every general asymmetric statement, also its symmetric 
one, obtained by exchanging the roles of 1 and 2, 
is valid. We call an orthonormal complete basis simply 
"basis". 

The natural framework for the Schmidt decomposition 
is general expansion in a factor-space basis. 

Let { 1 777)1 ■ Vm} be an arbitrary basis in H±. Then 
there exists a unique expansion 



|*)i 2 =^|m) 1 |m) 2 , 



(2a) 



where the generalized expansion coefficients {| m)' 2 
Vm} are elements of the opposite factor space H.2, 



where a m are complex numbers, and Vm : |m)i and 
I 771)2 are referred to as partners in a pair of Schmidt 
states. 

The term "Schmidt decomposition" can be replaced 
by " Schmidt expansion" or " Schmidt form" . To be 
consistent and avoid confusion, we'll stick to the first 
term throughout. 

Expansion (2a) is a Schmidt decomposition if and only 
if the first-factor-space basis {|m)i : Vm} is an eigen- 
basis of the corresponding reduced density operator p\ , 
where 

p, =tr^( |*>ia<*|i2 ), s,s' = l,2, s^s', (4) 
and tr s is the partial trace over TL S . 

Next we define a more specific and more useful form 
of the Schmidt decomposition. It is called canonical 
Schmidt decomposition. 

The non-trivial phase factors of the non-zero coeffi- 
cients a m in (3a) can be absorbed either in the basis 
vectors in Tii in (3a) or in those in H.2 (or partly 
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the former and partly the latter). If in a Schmidt de- 
composition (3a) all non-zero a m are non-negative real 
numbers, then we write instead of (3a), the following de- 
composition 

i*)i2=E r i /2 i^i^ ( 36 ) 

i 

and we confine the sum to non-zero terms (one is re- 
minded of this by the replacement of the index m by 
i in this notation). Relation (3b) is called a canonical 
Schmidt decomposition. (The term "canonical" reminds 
of the form of (3b), i. e., of Vi : r^ 2 > 0. ) 

Needless to say that every ^ ) 12 can be written as 
a canonical Schmidt decomposition. 

Each canonical Schmidt decomposition (3b) is accom- 
panied by the spectral forms of the reduced density oper- 
ators: 

ps = J~]n \i)s(i\s, s = l,2. (5a, b) 

i 

(The same eigenvalues r% appear both in (3b) and in 
(5a,b).) 

One should note that the topologically closed ranges 
H(ps), s = 1, 2 (subspaces) of the reduced density oper- 
ators p s , s = 1,2 are equally dimensional. The range- 
projectors are 

Q s = J2 I *)-(*!-. s = l,2. (5c, d) 

i 

The two reduced density operators have equal eigenvalues 
{ri : Vi} (including equal possible degeneracies). 

One has a canonical Schmidt decomposition (3b) if and 
only if the decomposition is bi-orthonormal and all ex- 
pansion coefficients are positive. 

Proof of these claims is straightforward. 

It is high time we introduce the sole entanglement en- 
tity inherent in any bipartite state vector, which is lack- 
ing from both forms of Schmidt decomposition discussed 
so far. It is an antiunitary map that takes the closed 
range TZ(p\) onto the symmetrical entity Tt(p2). (If 
the ranges are finite-dimensional, they are ipso facto 
closed, i. e., they are subspaces.) The map is called the 
correlation operator, and denoted by the symbol U a Q , 

If a canonical Schmidt decomposition (3b) is given, 
then the two orthonormal bases of equal power {| i)i : 
Vi} and {\t)2 '■ Vi} define an antiunitary, i. e., antilin- 
ear and unitary operator U a , the correlation operator 
- the sole correlation entity inherent in the given state 
vector | i2 : 

Mi : \i) 2 = (u a l*)i) 2 - (6a) 

The correlation operator U a , mapping TZ(pi) onto 
TZ(p2), is well defined by (6a) and by the additional 



requirements of antilinearity (complex conjugation of 
numbers, coefficients in a linear combination) and by 
continuity (if the bases are infinite). (Both these require- 
ments follow from that of antiunitarity.) Preservation of 
every scalar product up to complex conjugation, which, 
by definition, makes U a antiunitary, is easily seen to 
follow from (6a) and the requirements of antilinearity 
and continuity because U a takes an orthonormal basis 
into another orthonormal one. 

Though the canonical Schmidt decompositions (3b) are 
non-unique (even if p s , s = 1, 2 are non-degenerate in 
their positive eigenvalues, there is the non-uniqueness of 
the phase factors of | i )i ), the correlation operator 
U a is uniquely implied by a given bipartite state vector 

|*>12- 

This claim is proved in Appendix A. 

The uniqueness of U a when 1 3 / )i2 is given is a 
slight compensation for the trouble one has treating an 
antilinear operator. (Though the difficulty is more psy- 
chological than practical, because all that distinguishes 
an antiunitary operator from a unitary one is its antilin- 
earity - it complex-conjugates the numbers in any linear 
combination - and its property that it preserves the abso- 
lute value, but complex-conjugates every scalar product.) 
The full compensation comes from the usefulness of U a . 

Once the orthonormal bases {| i }i : Vi} and 
{| 1)2 : Vi} of a canonical Schmidt decomposition (3b) 
are given, one can write 

U a = Y,\ihK(i\i, (6b) 

i 

where K denotes complex conjugation. For instance, 

[4i0)i = 5>M^r i<> 2 . (6c) 



We finally introduce the most specific form of Schmidt 
decomposition. We call it a strong Schmidt decomposi- 
tion. 

If one rewrites (3b) in terms of the correlation operator 
by substituting (6a) in (3b), then it takes the form 

|*)l2=E^ /2 | i >l( f/ «l i )l) 2 - ^ 

i 

This is called a strong Schmidt decomposition. 

If a strong Schmidt decomposition (3c) is written 
down, then it can be viewed in two opposite ways: 

(i) as a given bipartite state vector | ^) yi defining its 
two inherent entities, the reduced density operator p\ 
in spectral form (cf (5a)) and the correlation operator 
U a (cf (6a)), both relevant for the entanglement in the 
state vector; and 

(ii) as a given pair (pi,U a ) ( U a mapping antiuni- 
tarily TZ(pi) onto some equally dimensional subspace 
of Ti,2 ) defining a bipartite state vector |4 , )i2- 
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The second view of the strong Schmidt decomposition 
allows a systematic generation or classification of all 
state vectors in Hi ® H 2 (cf (HI). 

One has 

P2 = U a piU~ 1 Q 2 , Pi = U~ 1 p 2 U a Qi (7a, b) 

(cf (6a) and (5a, b)). Thus, the reduced density operators 
are, essentially, " images" of each other via the correlation 
operator. (The term "essentially" points to the fact that 
the dimensions of the null spaces are independent of each 
other.) This property is called twin operators. 

When one takes into account the eigen-subspaces 
1Z(QI) of p s corresponding to (the common) distinct 
positive eigenvalues rj of p s , where Q J S projects 
onto the rj — eigen-subspace, s = 1,2, then one ob- 
tains a geometrical view of the entanglement in a given 
state I 12 in terms of the so-called correlated subsys- 
tem picture Q : 

© 

%)=^^ 5 ), a = l,2, (7c, d) 

3 

where " © " denotes an orthogonal sum of subspaces, 

Vj: K(Qi) = U a K(Q{), Tl(Qi) = U- 1 'R(Q 3 2 ), 

(7e,/) 

and, of course, 

K(p 2 ) = U a K( Pl ), n(p l ) = U- 1 n(p 2 ). (7g,h) 

In words, the correlation operator makes not only the 
ranges of the reduced density operators " images" of each 
other, but also the positive-eigenvalue eigen-subspaces. 
Equivalently, the correlation operator makes the eigen- 
decompositions of the ranges "images" of each other. 

One should note that all positive-eigenvalue eigen- 
subspaces TZ(Ql) are finite dimensional because 
y^ 4i Ti = 1 (a consequence of the normalization of 
"J 7 ) 12 ), and hence no positive-eigenvalue can have infi- 
nite degeneracy. 

The correlated subsystem picture of a given bipartite 
state vector is very useful in investigating remote influ- 
ences (as a way to understand physically the entangle- 
ment in the composite state) (see [ll|, and [9j). 

We will need the correlated subsystem picture of 
\^)i2 for the basic result of this section given below: 
the second theorem on twin unitaries. Namely, we now 
introduce this term for the pairs (Ux,U 2 ) following a 
long line of research on analogous Hermitian operators 
(see the last mentioned references and the next subsec- 
tion). 

If one has two opposite factor-space unitaries u\ and 
u 2 that, on defining U\ = (ux ® I2) and U 2 = (li ® 
u 2 ), act equally on the given composite state vector 

Ux |tf}i 2 = £/2 |*)l2, (8a) 



then one speaks of twin unitaries (unitary twin opera- 
tors). They give another, equivalent, view of envariance 
(see the Introduction), since, rewriting (8a) as 

^ 2 _1 ^l|*>12=|*>12, (8b) 

one can see that U^ 1 " untransforms" the action of 
Ui (cf(l)). 

It is easy to see that U\ | *)ia(* I12 U 1 1 = 
U 2 I *)i 2 (* 1 12 U2 1 is equivalent to 

Ui |*)i2 = e lX U 2 |*) 12 , (8c) 

where A eRi. This does not diminish the usefulness 
of definition (8a), because, if (8c) is valid for a pair 
(Ui,U 2 ), then one only has to replace these operators 
by (Ui,e lX U 2 ) , and the latter satisfy (8a). 

Henceforth, we will write U s both for u s , s = 1,2, 
and for (li ® u 2 ) or (m <g> I2) (cf (1)). 

First Theorem on twin unitaries. Opposite factor- 
space unitaries U\ and U 2 are twin unitaries if and 
only if the following two conditions are satisfied: 

(i) they are symmetry operators of the corresponding 
density operators: 

U sPs U- 1 = p s , 8=1,2, (8d,e) 

and 

(ii) they are the correlation-operator " images" of each 
other's inverse. Writing Qj: = l s — Q s , s = 1,2, this 
reads: 

U 2 = U a Ui 1 U- 1 Q 2 + U 2 Q^, (8/) 

Ux = U^U^UaQx + UxQi- (8g) 

(The second terms on the rhs of (8f) and (8g) mean 
that U s is arbitrary in the null space TZ(Q^) of 
Ps, 8 = 1,2. ) 

Proof. Necessity. 

Uxpx = Uxtr 2 ( |*)i 2 (*|i2 ) = 

tr 2 (ui |*)i 2 (*|i 2 ) =tr 2 ((C/ 2 |*>ia)(*|ia) = 

tr 2 ((|*)i2(*|i2)f/2) = tr 2 f |*}i 2 (*|i2 Ux) = pxUx- 

Symmetrically one derives (8e). 

Applying the definition of twin unitaries in the envari- 
ance form (8b) to | ^)x 2 , written as a strong Schmidt 
decomposition (3c), one obtains 

E^ /2 (^ i^K 1 ^ i»)i) 2 = E^ 1/2 H u « i^) 2 - 

i i 
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On account of the unitary property of Ui and C/ 2 ~ , 
the lhs is bi-orthonormal, hence also {U± \ i)i : Vi} is 
an eigen-basis of pi in TZ(pi) due to the necessary 
and sufficient condition for a Schmidt decomposition (see 
above (4)). Then, one can rewrite the lhs as the strong 
Schmidt decomposition with this basis. Thus, one ob- 
tains 

'£r 1 i ' 2 (u 1 \i) 1 )u^(u a \i) 1 ) 2 = 

i 

£r\ 1/2 (^ilOi)(tUM*>i) 2 . 

i 

Since the generalized expansion coefficients are unique, 
one concludes 

U^UaQ! = U a U x Qi 

(cf (5c)). One has U\ = U\Q\ + UiQi as a conse- 
quence of relation (8d), which has been proved already, 
and which implies commutation with all cigen-projectors 
Q{, and hence also with Qi~J2jQi (cf (7c)). There- 
fore, the obtained relation amounts to the same as (8g). 
The symmetrical argument establishes (8f). (Note that 
here one starts with the decomposition that is symmetri- 
cal to (3c), in which an eigen-sub-basis of p2 is chosen 
spanning TZ(p2), and U a is replaced by U~ . ) 

Sufficiency. Assuming validity of (8d), it immedi- 
ately follows that besides {| i)i : Vi} (cf (3c)) also 
{U\ | i)\ : Vi} is an eigen-sub-basis of p\ spanning 
TZ(pi). Hence, we can write a strong Schmidt decompo- 
sition as follows: 

I*)i2 = £(tfi |i>i)(tf«tfi K)i) 2 - 

i 

Substituting here (8g) in the second factors, 

i*)i 2 = E( c/ iN)i)K 1c/ «N)i) 2 

i 

ensues. In view of the strong Schmidt decomposition 
(3c), this amounts to | <J)i 2 = UiU? 1 | *)i 2 , i. e., 
(8b), which is equivalent to (8a), is obtained. □ 

It is straightforward to show (along the lines of the 
proof just presented) that the twin unitaries are also 
responsible for the non-uniqueness of strong (or of 
canonical) Schmidt decomposition. To put this more 
precisely, besides (3c) (besides (3b)) all other strong 
Schmidt decompositions (canonical Schmidt decompo- 
sitions) are obtained by replacing {\ i)i : Vi} in (3c) 
by {Ui | i)i : Vi}, where [Ui,pi] — (by replacing 

{|*>i |*> 2 :V*} in (3b) by { (ill \i)i) (lJf 1 \i) 2 ) : V^}, 

where [U Sl p s ] = 0, s = 1,2, and (8f) is satisfied). 

The set of all pairs of twin unitaries (Ui,U 2 ) 
is a group, if one defines the composition law by 



{UM) x (UuUz) = (U[U l7 U 2 U!,) (note the inverted 
order in H2 ), and taking the inverse turns out to be 
(Ui,U 2 y 1 = (^f 1 ,*^ -1 )- Tnis claim is proved in Ap- 
pendix B. 

Having in mind the subsystem picture (7a)- (7h) of 
I 12 , it is immediately seen that the first theorem on 
twin unitaries can be cast in the following equivalent 
form. 

Second Theorem on twin unitaries. The group of 
all twin unitaries (U\, U2) consists of all pairs of oppo- 
site factor-space unitaries that reduce in every positive- 
eigenvalue eigen-subspace 1Z(Q J S ), s — 1,2 (cf (7c, d)), 
and the reducees are connected by relations (8f,g) mu- 
tatis mutandis, or, equivalently, by (8f,g) in which Q s 
is replaced by Q{, s = 1,2, and this is valid simulta- 
neously for all j— components. 

In the language of formulae, we have all pairs of uni- 
taries (Ui,U~2) that can be written in the form 

U a = ]T UlQi + U s Qj, s = l,2, (9a, b) 

j 

Vj : UlQi = U^Uir'U-'Qi (9c) 

UlQi^U-^U^UaQi. (9d) 

Note that within each positive-eigenvalue subspace 
Ti(Qi) of p s , s = 1,2, all unitaries are encompassed 
(but not independently, cf (9c,d)). This will be impor- 
tant in the application in the next section. 

The next two (short) subsections round out the study 
of twin unitaries. The reader who is primarily interested 
in the argument leading to the probability rule is advised 
to skip them. 



B. Connection with twin Hermitians 

There is a notion closely connected with twin unitaries 
in a pure bipartite state: it is that of twin Hermitians (in 
that state). If a pair (Hi, H2) of opposite factor-space 
Hermitian operators commute with the corresponding re- 
duced density operators, and 

H 2 = U a HiU- 1 Q 2 +H 2 Qi, Hi = U- 1 H 2 U a Qi+HiQi 

(10a, 6) 

is valid then one speaks of twin Hermitian operators. 
(Relations (10a,b), in analogy with (8f,g), state that the 
reducees in the ranges of the reduced density operators 
are "images" of each other, and the reducees in the null 
spaces are completely arbitrary.) 

One should note that twin unitaries are, actually, de- 
fined analogously. To see this, one has to replace f/" 1 
by Uj in (8f,g), and H s by H\, s = 1,2, in 
(10a,b). 
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Twin Hermitians have important physical meaning 
(TH . @. But here we are only concerned with their con- 
nection with twin unitaries. 

If U s , s — 1 or s = 2 are symmetry operators of 
the corresponding reduced density operators, i. e., if they 
commute, then there exist Hermitian operators that also 
commute with the latter and 

U s = e iH >Q s + U s Qi, s = 1 or s = 2 (11a, b) 

is valid. And vice versa, if H s , s = 1 or s = 2 
are Hermitians that commute with the corresponding re- 
duced density operators, then there exist analogous uni- 
taries given by (lla,b). (The unitary and Hermitian re- 
ducees in the ranges determine each other in (lla,b), and 
the reducees in the null spaces are arbitrary.) 

The latter claim is obvious. But to see that also the 
former is valid, one should take into account that commu- 
tation with the corresponding reduced density operator 
implies reduction in each (finite dimensional) positive- 
eigenvalue eigen-subspace (cf (7c, d)). Then one can take 
the spectral form of each reducee of U s , and (11a, b) be- 
comes obvious (and the corresponding reducees of H s 
are unique if their eigenvalues are required to be, e. g., 
in the intervals [0, 2tt). ) 

The connection (lla,b), which goes in both directions, 
can be extended to twin operators. 

If (UijUz) are twin unitaries, then (11a, b) (with 
"or" replaced by "and") determine corresponding twin 
Hermitians, and vice versa, if (H\,H2) are twin 
Hermitians, then the same relations determine corre- 
sponding twin unitaries. 



C. Mixed states 

If P\2 is a mixed bipartite density operator, then we 
no longer have the correlation operator U a and the 
correlated subsystem picture (7a)-(7h). Nevertheless, in 
some cases twin Hermitians, defined by 

Hipi2 = H 2 pi2 (12a, b) 

have been found [3. (Their physical meaning was anal- 
ogous to that in the pure-state case.) It was shown that 
(12a,b) implied 

[H 8 ,p s ]=0, 8=1,2, (12c, d) 

where p s are again the reduced density operators. (Un- 
like in the case when p\2 is a pure state, in the mixed- 
state case the commutations (12c, d) are not sufficient for 
possessing a twin operator.) 

Relations (12c, d), in turn, again imply reduction of 
H s in every positive-eigenvalue eigen-subspace 1Z(Q° S ) 
of p s , s = 1,2, but now the dimensions of the corre- 
sponding, i. e., equal-j, eigen-subspaces are, unlike in 
(7c, d), completely independent of each other (but finite 



dimensional). In each of them, relations (11a, b) (with 
"and" instead of "or") hold true, and define twin uni- 
taries satisfying (8a) with p\2 instead of | v E , )i2- 

Thus, in some cases, the concept of envariance can be 
extended to mixed states. 



III. BORN'S RULE FROM TWIN UNITARIES 

The forthcoming argument is given in 5 stages; the first 
3 stages are an attempt to ti ghte n up and make more ex- 
plicit, Zurek's argument [H], [T^|. [la ]. (lr| by somewhat 
changing the approach, and utilizing the group of all pairs 
of twin unitaries (presented in the first subsection of the 
preceding section). The change that is introduced is, ac- 
tually, a generalization. Zurek's "environment", which, 
after the standard interaction with the system under con- 
sideration, establishes special, measurement-like correla- 
tions with it, is replaced. Instead, an entangled bipar- 
tite pure state I M/) 12 is taken, where subsystem 1 is 
the system under consideration, and 2 is some opposite 
subsystem with an infinite dimensional state space TL2 ■ 
We shall try to see to what extent and how the quantum 
probability rule follows from the quantum correlations, i. 
e., the entanglement in | 12 - 

The forth stage is new. It is meant to extend the ar- 
gument to states I <j))i which are not eigenvectors of 

the reduced density operator pi = tr 2 ^ | \I'} 12 1 12 

The fifth stage is also new. It extends the argument to 
isolated (not correlated) systems. 

Let I 'J) 12 be an arbitrary entangled bipartite state 
vector. We assume that subsystems 1 and 2 are not 
interacting. (They may have interacted in the past and 
thus have created the entanglement. But it also may have 
been created in some other way; e. g., by an external 
field as the spatial-spin entanglement in a Stern-Gerlach 
apparatus.) 

We want to obtain the probability rule in subsystem 
1. By this we assume that there exist probabilities, and 
we do not investigate why this is so; we only want to 
obtain their form. 

The FIRST STIPULATION is: (a) Though the given 
pure state | ) 12 determines all properties in the com- 
posite system, therefore also all those of subsystem 1 , 
the latter must be determined actually by the subsystem 
alone. This is, by (vague) definition, what is meant by 
local properties. 

(b) There exist local or subsystem probabilities of all 
elementary events | |i, | <j))\ £ Hi- (As it has 

been stated, we will write the event shortly as the state 
vector that determines it.) 

Since | , E r )i2 6 (Hi ® H%j, subsystem 1 is 

somehow connected with the state space TLi, but it 
is not immediately clear precisely how. Namely, since 
we start out without the probability rule, the reduced 
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density operator p\ = tr 2 y | v E')i2(* 1 12 J, though 

mathematically at our disposal, is yet devoid of physical 
meaning. We need a precise definition of what is local 
or what is the subsystem state. We will achieve this 
gradually, and thus pi will be gradually endowed with 
the standard physical meaning. 

The SECOND STIPULATION is that subsystem or 
local properties must not be changeable by remote action, 
i. e., by applying a second-subsystem unitary U2 to 
| \& ) 12 or any unitary U23 applied to the opposite 
subsystem with an ancilla (subsystem 3 ) . 

If this were not so, then there would be no sense in 
calling the properties at issue "local" and not "global" 
in the composite state. We are dealing with a definition 
of local or subsystem properties. By the first stipulation, 
the probability rule that we are endeavoring to obtain 
should be local. 

The most important part of the precise mathematical 
formulation of the second stipulation is in terms of twin 
unitaries (cf (8a)). No local unitary U\ that has a 
twin U2 must be able to change any local property. 

Stage one. We know from the First Theorem on twin 
unitaries that such local unitaries U\ are all those that 
commute with pi (cf (8d)) and no others. In this way 
the mathematical entity pi is already beginning to ob- 
tain some physical relevance for local properties. 

We know from the Second Theorem on twin unitaries 
that we are dealing with U\ that are orthogonal sums of 
arbitrary unitaries acting within the positive-eigenvalue 
eigen-subspaces of p\ (cf (9a)). 

Let |0}i and \4>)'i be any two distinct state vectors 
from one and the same positive-eigenvalue eigen-subspace 
1Z(Q{) of p\. Evidently, there exists a unitary U{ 
in this subspace that maps | (f>)i into | <j))\, and, 
adding to it orthogonally any other eigen-subspace uni- 
taries (cf (9a)), one obtains a unitary U\ in Hi that 
has a twin, i. e., the action of which can be given rise 
to from the remote second subsystem. ("Remote" here 
refers in a figurative way to lack of interaction. Or, to use 
Zurek's terms, 1 and 2 are assumed to be "dynam- 
ically decoupled" and "causally disconnected".) Thus, 
we conclude that the two first-subsystem states at issue 
must have the same probability. 

In other words, arguing ab contrario, if the probabil- 
ities of the two distinct states were distinct, then, by 
remote action (by applying the twin unitary U2 of the 
above unitary U\ to | 12 ) , one could transform one 
of the states into the other, which would locally mean 
changing the probability value without any local cause. 

Putting our conclusion differently, all eigen-vectors of 
pi that correspond to one and the same eigenvalue 
Tj > have one and the same probability in | \I/ } 12 - 
Let us denote by p(Q{) the probability of the, in gen- 
eral, composite event that is mathematically represented 
by the eigen-projector Q\ of p\ corresponding to 
Tj (cf (9a)), and let the multiplicity of rj (the di- 



mension of 7?.(Q{) ) be dj. Then the probability of 
I <f))i(4> |i is p(Q 1 1 )/dj. To see this, one takes a ba- 
sis {| <f>k)i ■ k = 1,2, ...,dj} spanning 1Z(Q\), or, 

equivalently, Q[ = J2k=i I <Pk)i{<Pk |i, with, e. g., 
\(j>k=i)i =\4>)\- Further, one makes use of the additivity 
rule of probability: probability of the sum of mutually ex- 
clusive (orthogonal) events (projectors) equals the same 
sum of the probabilities of the event terms in it. 

Actually, the er-additivity rule of probability is the 
THIRD STIPULATION. It requires that the probability 
of every finite or infinite sum of exclusive events be equal 
to the same sum of the probabilities of the event terms. 
We could not proceed without it (cf subsections V.E and 
V.F). The need for infinite sums will appear four passages 
below. 

In the special case, when p\ has only one positive 
eigenvalue of multitude d (the dimension of the range 
of pi ), the probability of \ (f>)i is p(Qi)/d (where 
Qi is the range projector of p\. ) To proceed, we need 
to evaluate p(Qi). 

To this purpose, we make the FOURTH STIPULA- 
TION: Every state vector \cf>)i that belongs to the nuO 
space of p\ (or, equivalently, when |(/>}i(0|i , acting on 
I "J/) 12, gives zero) has probability zero. (The twin uni- 
taries do not influence each other in the respective null 
spaces, cf (9a, b). Hence, this assumption is independent 
of the second stipulation.) 

Justification for the fourth stipulation lies in Zurek's 
original framework. Namely, if the opposite subsystem 
is the environment, which establishes measurement-like 
entanglement, then the Schmidt states, e. g., the above 
eigen-sub-basis, obtain partners in a Schmidt decompo- 
sition (cf (3a)), and this leads to measurement. States 
from the null space do not appear in this, and cannot 
give a positive measurement result. 

One has li = Qi + £j | Z)i(Z |i, where {| Z)i : VZ} 
is a basis spanning the null space of p\ , which may be 
infinite dimensional. Then, p(Qi) = p(li) = 1 follows 
from the third postulate ( cr-additivity) and the fourth 
one. Finally, in the above special case of only one positive 
eigenvalue of p\, the probability of | <j>)i £ Tl(pi) is 
1 Id, which equals the only eigenvalue of p\ in this case. 

Our next aim is to derive p(Q{) in a more general 
case. 

Stage two. In this stage we confine ourselves to com- 
posite state vectors \^}i2 (i) that have finite entangle- 
ment, i. e., the first-subsystem reduced density operator 
of which has a finite-dimensional range; (ii) such that 
each eigenvalue rj of p\ is a rational number. 

We rewrite the eigenvalues with an equal denomina- 
tor: Vj : rj = rrij/M. Since J2j^j r j = 1' onc nas 
djmj = M ( dj is the degeneracy or multiplicity 
of rj ). 

Now we assume that | 12 has a special structure: 
(i) The opposite subsystem 2 is bipartite in turn, 
hence we replace the notation 2 by (2 + 3), and 
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I*) 12 by |$}i23- 

(ii) a) We introduce a two-indices eigen-sub-basis of 
p\ spanning the closed range Tl(pi) ■ {\j,kj)\ : kj = 
1, 2, . . . , dj] Vj} so that the sub-basis is, as one says, 
adapted to the spectral decomposition p\ = J2j r jQi 
of the reduced density operator, i. e., Vj : Q{ — 
Ej=i \ j,kj)i{j,kj\i . 

b) We assume that H 2 is at least M dimen- 
sional, and we introduce a basis {| j,kj,lj) 2 : lj — 
1,2, ... ,mj\kj = 1,2,..., dj; Vj} spanning a subspace 
of H 2 . 

c) We assume that also H% is at least M dimen- 
sional, and we introduce a basis {| j,kj,lj) 3 : lj — 
1,2,..., rrij] kj = 1,2,..., dj; Vj} spanning a subspace 
of H 3 . 

d) Finally, we define via a canonical Schmidt decom- 
position 1 + (2 + 3) (cf (3b) and (5a)): 



123 



Y,Y,{mj/M)^\\j,kj) 



(^(l/mj^^kjjjh^kjjjh) 
\=i 



(13a) 



Equivalently, 



I $ )l23 E E(VM) 1 / 2 |j,fc,)l \j,kj,lj) 2 

j fcj = 1 = 1 

(136) 

Viewing (13b) as a state vector of a bipartite (1 + 
2) + 3 system, we see that it is a canonical Schmidt de- 
composition (cf (3b)). Having in mind (5a), and utilizing 
the final conclusion of stage one, we can state that the 
probability of each state vector j, kj)\ \ j,kj,lj) 2 is 
1/M. 

On the other hand, we can view (13a) as a state vec- 
tor of the bipartite system 1 + (2 + 3) in the form of 
a canonical Schmidt decomposition. One can see that 

Vj, (Q{®U) and E^iE^i I J,*i>iO\*j |i 
® | j, kj,lj)2{j, kj,lj | 2 act equally on | $}i 2 3. On the 
other hand, it is easily seen that the former projector can 
be written as a sum of the latter sum of projectors and 
of an orthogonal projector that acts as zero on | $)i23, 
and therefore has zero probability on account of stipu- 
lation four. Thus, (Q{ ® 1 2 ) and the above sum have 
equal probabilities, which is 



p(Q{ ® I2) = djirij/M. 



As it was concluded in Stage one, the probability of 
any state vector \<p)\ in TZ(Q{) is p(Q\)/dj. The 
projectors Q\ and (Q{ <g> I2) stand for the same 
event (viewed locally and more globally respectively), 
hence they have the same probability in | *&) 123 - Thus, 



p(\ (j>)i((j> |i) = rrij/M — rj, i. e., it equals the corre- 
sponding eigenvalue of p\. 

We see that also the eigenvalues, not just the eigen- 
subspaces, i. e., the entire operator pi is relevant for 
the local probability. At this stage we do not yet know 
if we are still lacking some entity or entities. We'll write 
A for the possible unknown. 

How do we justify replacing | ^)\ 2 by | $)i23? In 
the state space (H2 <§3 H3) there is a pair of orthonor- 
mal sub-bases of d = ^\ dj vectors that appear in 
(13a) (cf (15)). Evidently, there exists a unitary opera- 
tor 1/23 that maps the Schmidt-state partners | j, kj) 2 
of I j, kj)i in 1^)12 tensorically multiplied with an 
initial state | ^0)3 into the vectors: 



Vkj, Vj: U, 



23 



\j,kj) 2 \<j>oh 



E(iM) 1/2 \ j,kj,ijh \j,kj,ij) 3 . 



(15) 



On account of the second stipulation, any such U23, 
which transforms by interaction an ancilla (subsystem 
3 ) in state | ^0)3 and subsystem 2 as it is in |5')i2 
into the (2 + 3)-subsystem state as it is | <I>}i23, docs 
not change any local property of subsystem 1. Hence, 
it does not change the probabilities either. 

Stage three. We make the FIFTH STIPULATION: 
the sought for probability rule is continuous in pi, 
i. e., if pi = lim^oo,^, then p(E 1 ,p 1 ,X) = 
lim„^ oc p(i?i,p™, A), for every event (projector) E\. 
(We assume that A, if it exists, does not change in the 
convergence process.) 

Let pi — Ej=i r jQi> J a natural number, be the 
spectral form of an arbitrary density operator with finite- 
dimensional range. One can write p\ = linin^oo/0™, 
where p\ = Y^j=i r "0i> with r j = lhii^oo r™ , j = 
1,2. . . . , J, and all r™ are rational numbers. (Note 
that the eigen-projectors are assumed to be the same 
all over the convergence.) Then the required continu- 



ity gives for an cigcn-vcctor 
sponding to the eigenvalue rj 



rj a ) of pi corre- 
al r jo ),pi,X) = 
lim„^oop(| r JO }, p", A) = r J0 . This extends the con- 
clusion of stage two to all pi with finite- dimensional 
ranges, and their eigen- vectors. 



Let pi = 



j=l '3^1 



Q\ have an infinite-dimensional 



range. We define p\ = YTj=\ ( r j/(ELi r fc))<2i- 
(Note that we are taking the same eigen-projectors 
Q{. ) Then p\ — linin^oo p", and for any eigen- vector 



( 14 ) kio) one has p{\r jo ),pi, A) = \im n ^ ooP (\r jo ),p r t, A) 



Oohl 

limn^oo T"j /(Efc=i r fe) = r jo- This extends the conclu- 
sion of the preceding stage to all reduced density operators 
and their eigen-vectors. 

As a final remark about stage three, we point 
out that the continuity postulated is meant with 
respect to the so-called strong operator topology 
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in Hilbert space [l?}- Thus, if p = lim^oo/On, 
then, and only then, for every vector | ip ) one has 
p \ ip) = lira p„ This means, as well known, 

that lim n _Kx,||p | ip) — p n \ = (where the 

"distance" in the Hilbert space is made use of). 

Stage four. The result of the preceding stages can be 
put as follows: If pi |</>)i = r 1 0) 1, then the probability 



P(l0)l,Pl) = r = <0 1 1 Pi l< 



(We have dropped X because we already know that, 
as far as eigen-vectors of p\ are concerned, nothing is 
missing.) Now we wonder what about state vectors in 
H i that are not eigen-vectors of pi ? 

We make the SIXTH STIPULATION: Instead of pi, 
of which the given state \<t>)i is not an eigen-state, we 
take a different density operator p\ of which \4>)\ * s 
an eigenvector, i. e., for which p' x \ (\>) \ — r 1 \ (\>) \ is 
valid, and which is closest to p\ as such. We stipulate 
that the sought for probability is r' . (We expect that 
r' will be determined by the requirement of " closest as 
such".) 

The idea behind the stipulation is the fact that there 
exists non-demolition (or repeatable) measurement, in 
which the value (of the measured observable) that has 
been obtained is possessed by the system after the mea- 
surement, so that an immediate repetition of the same 
measurement necessarily gives the same result (it is not 
demolished; it can be repeated). There even exists so- 
called ideal measurement in which, if the system had a 
sharp value of the measured observable before the mea- 
surement, then it is not only this value, but the whole 
state that is not changed in the measurement. But in 
general, the state (the density operator) has to change, 
though minimally, in ideal measurement. The point is 
that in this change p — > p' the probability does not 
change (t/>\ p' \<f>) = p \ <j>). 

To make the requirement of "closest" more specific, we 
make use of a notion of " distance" in the set of density 
operators (acting in Tii ). As known, the set of all lin- 
ear Hilbert-Schmidt operators in a complex Hilbert space 
is, in turn, a complex Hilbert space itself (cf Appendix 
C). All density operators are Hilbert-Schmidt operators. 
Every Hilbert space is a distantial space, and "closest" is 
well defined in it. 

We are not going to solve the problem of finding the 
closest density operator to p\ because a related prob- 
lem has been solved in previous work of the author [l8[ . 
Namely, the fact that | <f))i is an eigenvector of p[ 
can be put in the equivalent form of a mixture 

p[=r' l^i^la + 

(1 -/•')[( |0>i<0|i ) Vi( 10)1^11)^/(1-/)]. (17) 

In (17) p[ is a mixture of two states, one in which 
|0)i(</>|i as an observable has the sharp value 1, and 
one in which it has the sharp value . 



In Ref. [18j it was shown that when a density operator 
pi is given, the closest density operator p' 1; among 
those that satisfy (17), is: 



MlP! |^)l |0>X<^h 



0)i<0|i Pi 



(16) Thus, 



/ = (0|iPi |< 



(18) 



(19) 



and the same formula (the last expression in (16)) ex- 
tends also to the case when \4>)i is not an eigenvector 
of pi. 

Incidentally, the requirement of closest p' to p un- 
der the restriction that the " closest" is taken among those 
density operators that are mixtures of states with sharp 
values of the measured observable A — J2k a kPk (spec- 
tral form) defines the Liiders state p' = ^ k PkpPk 
[HI]. (It was postulated [l9| : and as such it appears in 
textbooks [20|.) As well known, in ideal measurement 
p changes to the Liiders state. (In so-called selective 
ideal measurement, when one takes the subensemble cor- 
responding to a specific result, say, <Zfc , the change of 

state is p — > Pfc pPfc /tr(Pfc p). This is sometimes 

called "the projection postulate".) 

As a final remark on stage four, one should point out 
that "distance" in the Hilbert space of linear Hilbert- 
Schmidt operators also defines a topology, in particular 
a convergence of density operators. It is stronger than 
the so-called strong operator topology utilized in the 
preceding stage. More about this in Appendix C. 

Stage five. Finally, we have to find out what should 
be the probability rule when p is not an improper, but 
a proper mixture, i. e., when there are no correlations 
with another system. We take first an isolated pure state 
IV). 

We start with an infinite sequence of correlated bipar- 
tite state vectors {^12)™ : n = 1, 2, . . . , 00} such that, 
as far as the reduced density operator is concerned, one 
has 

Vn: p? = (l-l/n) |V)i(V|i + 



(lV)i(^|i)Vi l (lV)i(^l 



(20) 



where | ip)i actually equals | tp). (It is well known 
that for every density operator pi there exists a state 

vector |*) 12 such that pi = tr 2 M *) 12 (* 1 12 V This 

claim is easily proved using the spectral form (5a) of pi 
and the canonical Schmidt decomposition (3b).) We now 
write index 1 because we now do have correlations with 
subsystem 2. 
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Obviously 

|V>i(V>|i= Hm Pf- (21) 

n — >oo 

According to our fifth stipulation, the probability rule 
is continuous in the density operator. Hence, 

V |0) : p( = n lim p( |0>i,P?) = 

lira (0|i 1 0>! = (0|i lim |0> x . 
This finally gives 

V |0) : p( |0), - (01 ( WW ) 10) - K0IIV')! 2 . 

(22) 

In this way, the same probability rule is extended to iso- 
lated pure states. 

If p is an isolated mixed state, i. e., a proper mixture, 
one can take any of its (infinitely many) decompositions 
into pure states, say, 

P = Y2 Wk \ ^k){ipk\, 

k 

where Wk are the statistical weights ( Vfc : Wk > 
0; Efc^fc = 1 )■ Then 

p(|<#.p)=I>*M (k*X^*l) 1$. 

fc 

This finally gives 

p(|0),p) =(0|p|0), (23) 

extending the same probability rule to mixed isolated 
states. (It is obvious that the choice of the above 
decomposition into pure states is immaterial. One can 
take the spectral decomposition e. g.) 



IV. RELATION TO THE LITERATURE 

This article comes after 8 studies of thought-provoking 
analiticity [J, [3, [H, 0, @, 0, Q, H on Zurek's 
derivation of Born's rule. It has profited from most of 
them. 

The purpose of this section is not to review these arti- 
cles; the purpose is to contrast some ideas from 5 of these 
works with the present version in order to shed more light 
on the latter. 

A. SCHLOSSHAUER-FINE 

For the purpose of a logical order in my comments, I'll 
mess up the order of the quotations from the article of 
Schlosshauer and Fine on Zurek's argument Q. 

Schlosshauer and Fine are inspired to define the pre- 
cise framework for Zurek's endeavor and try to justify it 
saying (DISCUSSION, (A)): 



"Apart from the problem of how to do cos- 
mology, we might take a pragmatic point of 
view here by stating that any observation of 
the events to which we wish to assign prob- 
abilities will always require a measurement- 
like context that involves an open system in- 
teracting with an external observer, and that 
therefore the inability of Zurek's approach to 
derive probabilities for a closed, undivided 
system should not be considered as a short- 
coming of the argument." 

This may well be the case. In the present version, one 
views the probability rule as a potential property of the 
system. Measurement is something separate; it comes af- 
terwards when an observer wants to get cognizance of the 
probabilities. The present study is an attempt to view 
Zurek's argument in such a setting of ideas. Incidentally, 
in the present version one can no longer speak of an "in- 
ability of Zurek's approach to derive probabilities for a 
closed, undivided system". 

Besides, the "problem of how to do cosmology" is con- 
sidered by many foundationally minded physicists to be 
an important problem in modern quantum-mechanical 
thinking. After all, interaction with the environment and 
decoherence that sets in (a phenomenon to which Zurek 
gave an enormous contribution) is primarily observer- 
independent (though it may contain an observer), and it 
fits well into quantum cosmology. The present study en- 
visages Zurek's argument in a measurement-independent 
and observer-independent way. 

In their CONCLUDING REMARKS Schlosshauer and 
Fine say: 

"...a fundamental statement about any prob- 
abilistic theory: We cannot derive probabil- 
ities from a theory that does not already 
contain some probabilistic concept; at some 
stage, we need to "put probabilities in to get 
probabilities out". 

In the present version of the theory, a realization of 
this pessimistic statement can be seen in the assump- 
tion that local probabilities exist at all (in the first 
stipulation, (b)), and in the application of additivity 
(and cr-additivity) of probability (the third stipulation). 
Incidentally, the quoted claim of Schlosshauer and Fine 
is perhaps only mildly pessimistic [2l| 

As a counterpart of the stipulations in the present ver- 
sion, Schlosshauer and Fine state (near the end of their 
INTRODUCTION): 

"...we find that Zurek's derivation is based at 
least on the following assumptions: 

(1) The probability for a particular outcome, 
i. e., for the occurrence of a specific value 
of a measured physical quantity, is identified 
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with the probability for the eigenstate of the 
measured observable with eigenvalue corre- 
sponding to the measured value - an assump- 
tion that would follow from the eigenvalue- 
eigenstate link. 

(2) Probabilities of a system S entangled 
with another system £ are a function of 
the local properties of S only, which are 
exclusively determined by the state vector of 
the composite system S£. 

(3) For a composite state in the Schmidt form 
\4>ss) = J2k \sk) |efe), the probability for 

Sfe) is equal to the probability for | e^). 

(4) Probabilities associated with a system 
S entangled with another system £ re- 
main unchanged when certain transforma- 
tions (namely, Zurek's "envariant transforma- 
tions" ) are applied that only act on £ (and 
similarly for S and £ interchanged)." 

Assumption (1) is very important. It is the quantum 
logical approach. (See the comment on it in section V.B 
.) Assumption (2) is reproduced in the present version as 
the first stipulation. 

Having in mind the above quotation on "putting in 
and taking out probability" , assumption (3) was care- 
fully avoided in the present version, which goes beyond 
the Schmidt decomposition. In the approaches that hang 
on to the decomposition, and all preceding ones are such, 
putting in probability where it is equal to 1 seems un- 
avoidable. 

As to assumption (4), it is, to my mind, the basic 
idea of Zurek's argument. Though Schlosshaucr and 
Fine "consider Zurek's approach promising" (INTRO- 
DUCTION), they feel very unhappy about this basic as- 
sumption (DISCUSSION,' F2): 

" . . .we do not see why shifting features of £ , 
that is, doing something to the environment, 
should not alter the "guess"... an observer of 
S would make concerning 5-outcomes. 

Schlosshauer and Fine point to Zurek's desire to bol- 
ster his argument by a subjective aspect with an observer 
who observes only subsystem <S, but who is aware 
of the composite state vector | , i')s£- This observer 
" makes guesses" and " attributes likelihood" to state vec- 
tors | (j))s- Schlosshauer and Fine make critical com- 
ments on this aspect. 

Weighing if the subjective aspect at issue is useful or 
even justified is avoided in the present version. It was 
assumed that Zurek's argument can do without it (cf the 
comment on Caves's first-quoted remark about this). 

Schlosshauer and Fine finish the quoted passage say- 
ing: 

"Here, if possible, one would like to see some 
further argument (or motivation) for why the 



probabilities of one system should be immune 
to swaps among the basis states of the other 
system." 

Apparently, locality or subsystem-property is a basic 
stipulation (the first stipulation in the present version), 
i. e., the basic idea how Zurek envisages probability. 
Naturally, one may object that it is hindsight, because 
we know the probability rule, and it implies the locality 
idea. 

When thinking of quantum ideas without the prob- 
ability rule, as Zurek does, why not try to insert into 
them a local probability idea? The motivation lies in our 
intuitive expectation to find nature with as many local 
properties as possible (to enable us to do physics). After 
all, the well known tremendous reaction of the scientific 
community to Bell's theorem dealing with subquantum 
locality is an impressive indication of how important lo- 
cality is considered to be. 

Envariance, or twin unitaries in the present equivalent 
formulation, (and broader, see the second stipulation) 
provide us with a means to define what it means "local" 
or a " subsystem property" when the reduced density op- 
erator is devoid of physical meaning to begin with, and 
we do not know what the state of the subsystem is. The 
two subsystems S and £ are remote from each other. 
This means that they cannot dynamically influence each 
other. To put it in more detail, no ancilla (or measur- 
ing instrument) interacting with subsystem £ can have 
any dynamical influence on the opposite subsystem S. 

Now, isn't it natural to stipulate with Zurek, that 
subsystem or local properties of S are those properties 
that cannot be changed by "doing something" to the 
opposite subsystem (action of an ancilla included), or 
otherwise the property would be global? (It might be 
useful to point out that the essential role of locality in 
Zurek's derivation is made clear also in his "facts" (cf 
the sixth quotation in subsection IV. C), especially in 
fact 2.) 

As to the parenthetical final remark of Schlosshauer 
and Fine in assumption (4) (of the third quotation), 
the present version did not make use of "interchanged" 
roles of S and £. Entanglement "treats" the two 
subsystems in a symmetrical way. So the interchange is 
quite all right, but it was felt, in expounding the present 
version, that it was unnecessary. 

Schlosshauer and Fine say (DISCUSSION, (G)): 

"According to Zurek, ...the observer is aware 
of the "menu" of possible outcomes..." 

In the present version, one is after a local probabil- 
ity rule and, to start with, one has no other idea what 
"local" means, except what envariance gives. Gradually, 
one endows the reduced density operator of the subsys- 
tem with the known standard physical meaning. It seems 
that this gradual building up knowledge of what " local" 
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means for probabilities is in Zurek's wording handled by 
the imaginary observer to whom, besides 1 1 i')s£, only 
the subsystem S is accessible. But what is the "sub- 
system" ? The state space TLs and the state vectors in 
it are all that is at the imaginary observer's disposal and 
at ours to start to build the "subsystem" notion. This 
is Zurek's "menu" (in the understanding of the present 
author) . 

Perhaps, one should stress that, if one envisages 
probability as a potentiality, as it is done in the 
present approach, then it seems natural to take in 
the "menu" all state vectors | 4>)s' : not just those 
that are eigen-vectors of the reduced density operator 
ps, which, at the beginning, has almost no physical 
meaning. ("Almost" is inserted in view of the Second 
Theorem on twin unitaries.) Contrariwise, if one 
envisages probabilities in the process of measurement 
(or observation), as Zurek does (and his commentators 
follow him), then taking the Schmidt decomposition is 
the suitable procedure. In the present version, this is 
avoided (except in the mathematical interlude, in de- 
riving the properties of twin unitaries in subsection II. A). 

In the last passage of the DISCUSSION of 
Schlosshauer and Fine the basis of the opposite 
subsystem that appears in the Schmidt decomposition is 
subjected to though-provoking critical comments. This 
is one of the reasons why the present version kept clear 
of the Schmidt decomposition. 

As to the eigenvalue-eigenstatc link given in assump- 
tion (1) (third quotation), Schlosshauer and Fine say 
(DISCUSSION, (C)): 

" Clearly, from the point of view of observa- 
tions and measurements, we would like to as- 
sign probabilities to the occurrence of the spe- 
cific values of the observable O that has 
been measured, i. e.,to the "outcomes". The 
eigenvalue-eigenstate link of quantum me- 
chanics postulates that a system has a value 
for an observable if and only if the state of 
the system is an eigenstate characteristic of 
that value (or a proper mixture of those eigen- 
states)." 

In the preceding section it was assumed that events are 
represented by projectors. This is the quantum logical ap- 
proach (because projectors can be interpreted as events, 
properties or logical statements), in which the projec- 
tors are more elementary than observables. (Mathemat- 
ically, one constructs Hermitian operators out of projec- 
tors using the spectral theorem.) Physically, the yes- 
no experiments carry the essence of quantum mechanics. 
The quantum logical approach is resumed in subsection 
V.B(a). (Zurek, in his Phys. Rev. paper, seems to be 
trying to take a more general approach: he is dealing 
with potential future records.) 



On the other hand, observables and their eigenval- 
ues ("outcomes") are the standard or textbook start- 
ing point for probabilities. Utilizing the eigenvalue- 
eigenstate link, leading to the quantum logical stand- 
point, is a choice of approach, which has to be justified 
in the end. Namely, when the probability rule is finally 
available, the eigenvalue-eigenstate link is a theorem: A 
state (density operator) p has the sharp value o of an 
observable O if and only if (i) the former is an eigen- 
value of the latter and (ii) p, when written as any mix- 
ture (possibly a trivial one)e states, it consists only of 
eigen- states of O corresponding to this eigenvalue (cf 
the Introduction in [22|). 

Finally, it should bepointed out what has been taken 
over from the article [2j of Schlosshauer and Fine. The 
second quotation led to caution concerning "putting in" 
as little probability as possible. It was the reason for 
avoiding the use of the Schmidt decomposition and hence 
also assumption 3 (in the third quotation) . The last quo- 
tation gave rise to thoughts about the non-contextuality 
involved (cf subsection V.B). 

B. Barnum 

In what follows a few comments in connection with 
Barnum's reaction [3J] to Zurek's derivation of probability 
will be given. 

Barnum says (p. 2, left column): 

"In our opinion, the version of Zurek's argu- 
ment we give below does not depend crucially 
on whether measurement is interpreted in 
this way (relative state interpretation, F. EL), 
or as involving "collapse", or in some other 
way (for example as involving "collapse" of 
our knowledge, say in a process similar to 
Bayesian updating [23j)." 

Hopefully, also the version of Zurek's argument ex- 
pounded in the preceding section is independent of the 
existence or non-existence of objective "collapse" in na- 
ture. (As to purely subjective "Bayesian updating", it 
is hard to see what one can update if nothing happened 
in nature. Let us be reminded of John Bell's famous 
dictum: "Information? Whose information, information 
about what?" But, some of us may just be incorrigible 
realists, "whatever realism means" - as the late Rudolph 
Peierls used to say.) 

Assuming the existence of objective collapse, there are 
two remote effects due to entanglement: distant measure- 
ment |11| . or more generally, remote ensemble decompo- 
sition [!| , and remote preparation [24[ , [2f| , @ (the selec- 
tive aspect of the former). It all started with Schrodinger 
[H| , who pointed out that doing a suitable selective mea- 
surement on subsystem 2, one can " steer" (his word for 
remote preparation) the remote system 1 into any state 
| <j)) i that is an element of the range of p\ , but with a 
certain positive probability. (Schrodinger assumed that 
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the range was finite dimensional. This was extended to 
| <f))x 6 7\L(/9^ 2 ) in 25] for infinite dimensional ranges, 
and the maximal probability, i. e., the best way to do 
remote preparation, was evaluated recently @.) 

Neither Schrodinger (26[, [24J, nor anyone in the Bel- 
grade group who worked on his program of " disentangle- 
ment" [ll[, P3> 01 nas ever, to the best of the present 
author's knowledge, tried to utilize remote preparation 
for an argument of probability because this would be 
"putting probability in to get probability out" (cf the 
second quotation in the preceding subsection), i. e., an 
evidently circular argument. 

It is a beauty of Zurek's argument that envariance, 
or remote unitary operation if one takes twin unitaries 
(the other face of envariance), has no probability at the 
start. It is deterministic: You perform a [7 2 local 
transformation on the opposite subsystem, and ipso 
facto one gets deterministically the transformation U± 
on the subsystem that is investigated. So, Zurek seems 
to be quite right that this concept can be used to shed 
light on the quantum probability notion (as far as it is 
assumed to be local). 

One gets the impression that Barnum feels that his 
insistence on no signalling and symmetric roles that S 
and £ should play is an important improvement on 
Zurek's argument. In particular, Barnum says (p. 2, 
right column): 

"Perhaps, however, there is a stronger ar- 
gument for no S-to-£ signalling in rela- 
tive state interpretation. On such an inter- 
pretation, once macroscopic aspects of £ 
have been correlated with S (the system 
has been "measured" by an observer who is 
part of £ ), the ability to affect probabili- 
ties of components of the state in subspaces 
corresponding to those distinct macroscopic 
aspects of £, by manipulating S, jeopar- 
dizes the interpretation of these numbers as 
"probabilities" at all. ... (within a generally 
subjectivist approach to probability in its as- 
pect as something to be used in science and 
everyday life..., an approach to which I am 
rather partial),..." 

Barnum is, of course, consistent. The purpose of 
quoting this passage is mostly to underline the difference 
in the approaches to Zurek's argument by Barnum and 
the present version. Namely, in the latter an attempt is 
made to keep the remote influence in one direction only, 
as Zurek originally did. Not because Barnum appears 
to be wrong; it is because the one-direction approach is 
considered simpler. There is another difference: Barnum 
says to be partial to subjectivism, and the present 
author has confessed above to be a realist. (This is not 
in the sense to negate or underestimate subjectivism. 
But the latter is understood by the present author as 



subjective cognizance of objective reality.) 

Barnum says (p. 3, both columns): 

"...if the joint state S£ is viewed as the 
outcome of a measurement "in the Schmidt 
basis" on S, by an environment £ that 
includes the observer, whose "definite mea- 
surement results" line up with the Schmidt 
basis for £, ascribing probabilities to these 
suffices for ascribing probabilities to " definite 
measurement results" ..." 

Also Schlosshauer and Fine pointed to this feature 
of Zurek's argument of "putting in probability" in £ , 
and "getting out" probability in S (cf the second 
quotation and assumption 3 in the third quotation in 
the preceding subsection). Apparently, Zurek "puts in" 
no more than (probabilistic) certainty. This certainly is 
not circularity. Nevertheless, the present version takes 
another route. 

There is another aspect of the present version that it 
shares with Zurek's original one. It is assuming non- 
contextuality. But let us first see what Barnum says on 
the subsject (p. 3, right column): 

"Note that we have not yet established that, 
for a given state, the probabilities of com- 
ponents in subspaces are independent of the 
subspace decomposition in which they occur, 
an assumption similar to that made in Glea- 
son's theorem, and which might allow us to 
use Gleason's theorem as part of an argument 
for quantum probabilities. Of course, a po- 
tential virtue of the argument from envari- 
ance is precisely that it does not make any 
such assumption to begin with." 

One is here on quantum-logical grounds. Quantum- 
logical non-contextuality means, in the understanding of 
the present author, that if F is a composite event (the 
projector project onto a more-than-2 dimensional sub- 
space), then no matter in which of the infinitely many 
possible ways F is written as a sum of mutually exclu- 
sive (orthogonal) elementary events (ray projectors), and 
defined in this way, the probability of F is one and the 
same. This is so on account of c-additivity. (See also 
the discussion in subsection V.B(a)). 

It is hard to see how one can avoid the quantum-logical 
non-contextuality in Zurek's argument. Namely, when 
one wants to evaluate the probabilities of the equally 
probable states \ <fi)i that correspond to one and the 
same eigenvalue of p\ (stage one in the preceding sec- 
tion), one cannot avoid using additivity. Besides, also 
in the evaluation of the probability of the eigen-event 
Qi (the range projector) when p\ has only one pos- 
itive eigenvalue requires the use of additivity (and the 
zero-probability assumption, cf the third and the fourth 
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stipulations in the preceding section). Then, as it was 
argued in the preceding passage, quantum-logical non- 
contextuality has been utilized. (More on this in subsec- 
tions V.B and V.E. See also subsection V.F.) 

Gleason gives the complete answer (cf subsection 
V.F). Then what is the point of Zurek's argument? I'll 
attempt an answer to this worrisome question in the 
concluding comments in the next section (see subsection 
V.F). 

After the quoted passage, Barnum writes about, what 
he calls, the Perfect Correlation Principle. From the 
point of view of the Belgrade group, he talks about 
twin observables (cf subsection B on twin Hermitians 
in section II.): The measurement of any subsystem 
observable that is compatible (commuting) with the 
corresponding reduced density operator is ipso facto 
also a measurement (so-called distant measurement) of 
a twin observable on the opposite subsystem. 

Barnum further says, speaking of Stan and Emma 
instead of subsystems, and applying his S — ► £ no- 
remote-influence ("no signalling") approach (p. 3, right 
column) : 

"Whether or not Stan measures anything 
should be immaterial to Emma's probability, 
by no-signalling." 

Twin Hermitians are mathematically very closely con- 
nected with twin unitaries (subsection B in section II.). 
Distant measurement can make non-contextuality very 
plausible for suitable, i. e., with the reduced density op- 
erator compatible, subsystem observables. But distant 
measurement is derived from the probability rule in quan- 
tum mechanics. This way one cannot avoid circularity. 

Subsystem observables not compatible with the cor- 
responding density operator do not give rise to distant 
measurement; they cause distant ensemble decomposi- 
tion (see @). Here we are outside envariance, i. e., we 
are using subsystem unitaries (in the sense of subsection 
II. B) that do not have a twin. 

On his page 5, left column, Barnum discusses at length 
Zurek's assumption of continuity of probability as a func- 
tion of ps ■ Among other things, he says: 

"It is not clear to us why one would rule out 
discontinuous probability assignments even 
though they may seem "pathological"." 

In the preceding section "continuity" entered as the 
fifth stipulation. It has led, in the end, to the quantum 
probability rule. The argument presented leaves open 
the possibility that also probability that is not contin- 
uous in p might exist. But we know from Gleason's 
theorem that, though he does assume continuity in 
the projectors (via c-additivity as a strengthening 
of additivity, cf subsection V.E), he does not assume 



continuity in p. Thus, probability discontinuous in p 
does not seem to exist. 

The present author is especially indebted to Barnum 
for his useful suggestion about how to extend Zurek's ar- 
gument to state vectors | <f>) \ that are not eigenvectors 
of p\. He suggested (in private communication): "Per- 
haps one could get somewhere by making assumptions 
about probabilities zero and one..." This fitted in well 
with the theorem from previous work on the closest suit- 
able state, i. e., state of zero and one probabilities (cf the 
sixth stipulation in section III of this article and relation 

Finally, it should be stated what is the main insight 
gained from the article 3] of Barnum. It confirmed 
the suspicion, stemming from Zurek's writings, that 
the concrete idea of system and environment can be 
generalized to any entangled subsystems. (Stan and 
Emma achieve this.) The continuity assumption is not 
as trivial as one might think. Barnum made me give a lot 
of thought to the quantum-logical non-contextuality (cf 
subsection V.B(a)), and the relation between Gleason's 
theorem and Zurek's argument (cf subsection V.F). 



C. Zurek's most mature article on envariance 

Zurek in his most mature, Physical Review, article 
[l6| takes into account the comments of Schlosshauer 
and Fine and Barnum. The exposition of the preceding 
section will now be put in relation to Zurek's original 
argument presented there. (Quotations will be taken 
from pages in the archive copy, version 2.) 

In the abstract Zurek says: 

"Probabilities derived in this manner (he 
means from envariance, F. H.) are an objec- 
tive reflection of the underlying state of the 
system - they represent experimentally veri- 
fiable symmetries, and not just a subjective 
"state of knowledge" of the observer." 

In the present version, one confines oneself to this atti- 
tude of the founder of envariance, though he finishes the 
abstract as follows. 

"Envariant origin of Born's rule for proba- 
bilities sheds a new light on the relation be- 
tween ignorance (and hence information) and 
the nature of quantum states." 

On p. 1, left column he completes this thought as 
follows: 

" The nature of " missing information" and the 
origin of probabilities in quantum physics are 
two related themes, closely tied to its inter- 
pretation." 
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One cannot but fully agree with this. The subjective 
side of Zurek's argument has, nevertheless, been disre- 
garded in the present version because considerably more 
than the basic quantum formalism has been made use of 
in it (unlike in the preceding versions), and, hence, it is 
quite intricate as it is. 

On p. 1, left column, Zurek says: 

"We shall, however, refrain from using 
"trace" and "reduced density matrix". Their 
physical significance is based on Born's 
rule...., to avoid circularity..." 

In contrast to Zurek's original version, in the present 
one not only that "trace" and "reduced density matrix" 
are not avoided, they are the mathematical starting 
point. Admittedly, they are at the start physically 
devoid of meaning. But the second theorem on twin 
unitaries (the other face of envariance) in subsection A 
of section II. discloses the relevance of these concepts 
for envariance. Since one of the basic ideas of Zurek 
is that the probabilities in the system S are local, 
and we do not have the reduced density matrix p$ 
determining the subsystem state and thus defining 
locality, it appears natural to use envariance (twin 
unitaries) for the definition of what is local. Then, the 
mathematical notion of the reduced density matrix turns 
out to be relevant, and gradually, taking the steps of 
Zurek's argument, the reduced density matrix becomes 
endowed with the standard physical meaning. 

At the beginning of his argument, on p. 2, right 
column, Zurek lines up the basic assumptions of " bare" 
quantum mechanics (or quantum mechanics without 
collapse): that the universe consists of systems, each 
of which has a state space; that the state space of 
composite systems are tensor products; and that the 
unitary dynamical law is valid. (See also Zurek's three 
spelled out "Facts" - the sixth quotation below.) All 
these were tacitly assumed in section III. 

At the beginning of the left column, p. 3, Zurek says: 

"We shall call the part of the global state that 
can be acted upon to affect such a restoration 
of the preexisting global state the environ- 
ment £ . Hence, the environment- assisted 
invariance, or - for brevity - envariance. We 
shall soon see that there may be more than 
one such subsystem. In that case we shall use 
£ to designate their union." 

It appears that Zurek envisages, actually, more-or-less 
the whole universe , or at least, a large part of it 
containing all systems that have ever interacted with 
the subsystem S at issue. In contrast to this, the 
version of the argument in section III laid emphasis 
on the existence of entanglement with any opposite 



subsystem (but cf subsection V.D). Any larger system 
(1 + 2) in any entangled state | x P}i2 that has one 
and the same local or first-subsystem probability would 
do. Since subsystem 2 is arbitrary, it can also be the 
environment as Zurek envisages it. 

On p. 4, left column, Zurek lists three "facts", which 
he considers basic to his approach. 

"Fact 1: Unitary transformations must act 
on the system to alter its state. (That is, 
when the evolution operator does not operate 
on the Hilbert space Hs of the system, i. 
e., when it has a form . . . (g> I5 ® . . . the 
state of S remains the same.) 

Fact 2: The state of the system S is 
all that is needed (and all that is available) 
to predict measurement outcomes, including 
their probabilities. 

Fact 3: The state of a larger composite sys- 
tem that includes S subsystem is all 
that is needed (and all that is available) to 
determine the state of the system <S." 

Zurek adds "... the above facts are interpretation- 
neutral and the states (e. g., 'the state of S ') they 
refer to need not be pure." 

I find Zurek's "facts" fully acceptable, and I have tac- 
itly built them into the present approach (like the above 
basic assumptions of the no-collapse part of quantum 
mechanics). Actually, his broad "state" concept helped 
me to decide to stick to the reduced density operator 
pi, the physical relevance of which is suggested by the 
two theorems on twin unitaries in subsection II. A. As it 
could be seen in section III, Zurek's argument enables 
one to endow the mathematical concept of the reduced 
density operator gradually with the standard physical 
meaning yielding the quantum probability rule. 

On p. 4, left column, Zurek says: 

"Indeed, Schmidt expansion is occasionally 
defined by absorbing phases in the states 
which means that all the non-zero coefficients 
end up real and positive ... . This is a dan- 
gerous oversimplification. Phases matter... ." 

Zurek is, of course, quite clear about the role of 
canonical Schmidt decomposition (see section II. A 
above). What he means, I believe, is that one must be 
careful about phases in any expansion of the global state; 
one can disregard them only after a careful analysis 
as the one he presents. Since the present version goes 
beyond the Schmidt decomposition, it turned out that 
the separate question of phases actually does not come 
up. 

On the other hand, one can fully accept his words (p. 
4, bottom of right column): 
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"Lemma 3 we have just established is the cor- 
nerstone of our approach." 

His Lemma 3 is about envariant swaps of orthogonal 
first-subsystem eigenstates of pi, and, later in his 
Theorem 2., it implies their equal probability. In 
methodological contrast to Zurck's Lemma 3, in section 
III above the second theorem on twin unitaries (section 
II. A) was used to establish equal probability of any two 
state vectors in one and the same eigensubspace of pi . 
But, this is, of course, equivalent to Zurck's Theorem 2. 

On p. 5, left column, Zurck gives a very nice discussion 
of the complementarity between knowledge of the whole 
and knowledge of the part - complementarity of global 
and local due to entanglement . There was no need to 
enter this in the present version. 

On p. 7, right column, Zurek says: 

"Let us also assume that states that do not 
appear in the above superposition (i. e., ap- 
pear with Schmidt coefficient zero) have zero 
probability. (We shall motivate this rather 
natural assumption later in the paper.)" 

This is the fourth stipulation in section III. This is 
" rather natural" when we already know the quantum rule 
of probability. In Zurek's setting of no such knowledge, 
it appears to come out of the blue. But a stipulation can 
do this. 

Zurek resumes this question on p. 19, left column, con- 
sidering a rather intricate composite state "representing 
both the fine-grained and the coarse-grained records". 
He essentially describes observation or measurement in 
my understanding. He says: 

"The form of ... (the composite state, F. H.) 
justifies assigning zero probability to ... (state 
vectors of the system, F. H.) that do not ap- 
pear, - i. e., appear with zero amplitude - in 
the initial state of the system. Quite simply, 
there is no state of the observer with a record 
of such zero-amplitude Schmidt states of the 
system ... (in the composite state, F. H.)." 

This is convincing in the context of Zurek's objective 
probabilities - as he calls them. If probability is treated 
as a potentiality, no matter if it will be ever measured or 
not, as it is in the present approach, then one had better 
not use this argument. (It is used only as a plausibility 
justification in the present version.) 

On p. 7, right column, Zurek says: 

"Moreover, probability of any subset of n 
mutually exclusive events is additive. ... We 
shall motivate also this (very natural) as- 
sumption of the additivity of probabilities 



further in discussion of quantum measure- 
ments in Section V (thus going beyond the 
starting point of e. g. Gleason ...)" 

Zurek has stated (on p. 5, left column) that he will use, 
besides envariance, also "a variety of small subsets of nat- 
ural assumptions". At this place of his text, it appears 
that additivity of probability is one of them. Actually, 
it is a very strong assumption on the quantum-logical 
ground(cf the discussion of this in subsections V.B(a) 
and V.E). One can accept that the measurement context 
makes it more plausible, but it still is an extra assump- 
tion. 

Zurck resumes this question on pp. 18 and 19. He is 
at pains to derive " additivity of probability from envari- 
ance" . He says: 

"To demonstrate Lemma 5 (a key step in 
his endeavor, F. H.) we need one more prop- 
erty - the fact that when a certain event U 
{p{U) = 1) can be decomposed into two mu- 
tually exclusive events, U = k V k , their 
probability must add up to unity: 

p{U) = p(k V fc- 1 ) = p{k) +p(k ± ) = 1. 

This assumption introduces (in a very lim- 
ited setting) additivity. It is equivalent to 
the statement that "something will certainly 
happen" ." 

We have discussed above the Schlosshauer and Fine 
comment " you put in probability, to get out probability" . 
Zurek's just quoted passage looks somewhat similar: 
you put in additivity, to get out additivity (though you 
put it in "in a very limited setting", but at the crucial 
place). This question is resumed in detail in subsection 
V.E. 

Zurek starts his subsection D. of section II. stating 
that he will "complete derivation of Born's rule" by 
considering the case of unequal absolute values of the 
coefficients in the Schmidt decomposition. Clearly, 
unlike section III of this paper, Zurek had no intention 
to go further than encompassing the eigenvectors of pi . 
In his terminology, that is "Born's rule". 

Zurck finishes section II., after he has discussed ratio- 
nal moduli of Schmidt coefficient (which has been com- 
pletely taken over in section III above) saying: 

"This is Born's rule. The extension to the 
case where \ak\ 2 (the moduli, F. H.) are 
incommensurate is straightforward by conti- 
nuity as rational numbers are dense among 
reals." 

This seems to be another of Zurek's "natural assump- 
tions". In the present version, it was raised to the level 
of a stipulation following the convincing discussion of 
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Barnum (cf the last quotation and the last passage in 
the preceding subsection). 

Zurek's section V is devoted to a rederivation of Born's 
rule from envariance. In his section II. the environment 
£ could and needed not contain an observer. He didn't 
actually make use of him. In section V the observer is 
explicitly made use of (consistent with, e. g., the relative- 
state theory of Everett [27j). One gets the feeling that 
this exposition, in which it is explicit that Zurek is after 
probability in the process of measurement (or observa- 
tion), is more convincing and successful. 

In the present version, measurement is " off limits" (as 
Zurek would say) . Twin unitaries (the other face of en- 
variance) are a direct consequence of entanglement (cf 
subsection II. A of this article). In the present version, 
Zurek's argument was treated as strong enough to carry 
out the complete program: quantum probability rule 
from entanglement, treating the former as a potential- 
ity. This standpoint is, apparently, in keeping with the 
following passage of Zurek's paper. 

On p. 23, left column, Zurek says: 

"...even when one can deduce probabilities a 
priori using envariance, they better be consis- 
tent with the relative frequencies estimated 
by the observer a posteriori in sufficiently 
large samples. ... We shall conclude that 
when probabilities can be deduced directly 
from the pure state (he means | 1 $ , )ss, F. 
H.), the two approaches are in agreement , 
but that the a priori probabilities obtained 
from envariance-based arguments are more 
fundamental." 

Precisely so! Because probabilities are an a priori 
notion, and "more fundamental" than the relative 
frequencies, in terms of which they are measured, the 
probabilities should be treated as a potentiality. 

Finally, it is needless to state what has been learn't 
from Zurek. The entire theory is his. The rest of us 
are only conjuring up different variations on it to gain a 
deeper grasp of the matter. 



D. Mohrhoff 

I'll begin with the abstract of Mohrhoff 's paper [i[ on 
Zurek's "Born's rule from envariance" argument, which 
lacks Zurek's Physical Review paper (discussed in the 
preceding subsection), and both Barnum's article and the 
one of Caves in its references. Mohrhoff says: 

"Zurek claims to have derived Born's rule 
noncircularly. . . from deterministically evolv- 
ing quantum states. ... this claim is exag- 
gerated if not wholly unjustified. ...it is not 



sufficient to assume that quantum states are 
somehow associated with probabilities and 
then prove that these probabilities are given 
by Born's rule." 

Mohrhoff calls in question the, as he puts it, "so-called 
derivation" of Born's rule. Strictly logically, " derivation" 
of a claim means that the claim is a necessity. Now, 
probabilities are a necessity in a deterministically evolv- 
ing universe from a physical point of view as made clear 
in section V of Zurek's Phys. Rev. paper. But logically, 
Mohrhoff is right that one assumes the existence of 
probabilities, and then one finds out what they look like. 
The present version is certainly not better than that. 

Mohrhoff even strengthens his critical attitude on p. 
4 (the archive version is taken) after having shortly re- 
viewed Zurek's argument: 

"What is thereby proved is that if quan- 
tum states are associated with probabilities 
then Born's rule holds. But how do quan- 
tum states come to be associated with prob- 
abilities? As long as this question remains 
unanswered, one has not elucidated the origin 
of probabilities in quantum physics, as Zurek 
claims to have done." 

In spite of Zurek's wording in expounding his argu- 
ment, he does not appear to be claiming to have answered 
Mohrhoff's "question"; the present version certainly has 
not. One becomes pessimistic at this point, and one is 
inclined to partially agree with Mohrhoff's first sentence 
in his Introduction: 

"In any metaphysical framework that treats 
quantum states as deterministically evolving 
ontological states, such as Everett's many- 
worlds interpretation, Born's rule has to be 
postulated." 

Zurek's derivation of Born's rule suggests that this 
claim should be weakened be replacing "Born's rule" in 
it by "probability". 

In the following quotation (bottom of p. 6), Mohrhoff 
hits at the very foundation of Zurek's argument. 

"The rather mystical-sounding statement 
that knowledge about the whole implies igno- 
rance of the parts (he means complementar- 
ity of global and local, F. H.) is thus largely a 
statement about correlated probability distri- 
butions over measurement outcomes. Given 
its implicit reference to probabilities, it does 
not elucidate the " origin of probabilities" but 
rather shows that probabilities are present 
from the start, however cleverly they may be 
concealed by mystical language." 
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As far as correlated probability distributions are con- 
cerned, Mohrhoff has a point. Indeed, the remote effects, 
which can be, in principle, either immediately confirmed 
by coincidence measurement or subsequently by a suit- 
able measurement on the opposite (remote) subsystem, 
are observationally nothing else than correlated probabil- 
ities. 

Does this ruin Zurek's argument? I think not at all. 
Complementarity of global and local is a well known 
fact. Besides, entanglement should be understood as 
another peculiar potentiality, which can lead to the 
potentiality of probability. After all, the latter is what 
Zurek is after (at least as it is understood in the present 
version). Hopefully, these potentialities are not just 
" mystical language" " concealing" the true state of affairs 
(cf subsection V.C). 

Mohrhoff's rejection of Zurek's argument is rather 
deep-rooted. On p. 7 he says: 

"To my mind, the conclusion to be drawn 
from the past failures (including Zurek's) to 
derive probabilities noncircularly from de- 
tcrministically evolving ontological quantum 
states, is that quantum states are probabil- 
ity measures and should not be construed as 
evolving ontological states. Theorists ought 
to think of them the way experimentalists 
use them, namely, as algorithms for comput- 
ing the probabilities of possible measurement 
outcomes on the basis of actual measurement 
outcomes." 

It seems that Mohrhoff has accepted Bohr's stand- 
point that ontology in quantum physics is metaphysics, 
i. e., beyond physics, perhaps philosophy. Mohrhoff has 
even strengthened Bohr's rejection of a nowadays rather 
widely accepted ontology speaking of " pseudophysics" 
(or false physics). He seems to be, what one sometimes 
calls, an "instrumentalist" believing only in the reality of 
the laboratory instruments; the rest is "mystical 

language" [2g|. This calls to mind Mermin's, perhaps 
somewhat unjust, nickname for such a standpoint: "the 
shut up and calculate interpretation of quantum mechan- 
ics" (cf the article by Schlosshauer and Fine). 

Though Mohrhoff stands at the farthest from the 
ontological standpoint of Zurek and the rest of his com- 
mentators (including the present author), his criticism 
and objections should be taken seriously. After all, 
ontology is also a potentiality; if one does not believe in 
it, you can't prove it. 

Finally, let it be stated what has been learnt from 
Mohrhoff's article. His scepticism about the non- 
circularity of Zurek's argument (cf the first quotation, 
and especially the second one) helped to decide to try 
to treat probability as a potentiality (without any mea- 
surement or observation). Next, following Mohrhoff's 
explicit warning (see his third quotation), the present 



version postulates the existence of probability (as part 
of the first postulate). Mohrhoff's uncompromising 
attitude is a challenge that has led to an attempt to put 
Zurek's argument in a transparently non-circular way. 
To what extent the present version has succeeded in this 
will be discussed again in the next section (cf subsection 
V.C). 



E. Caves 

Caves' reaction Q to Zurek's argument appeared with 
all the references that have been commented upon so far. 

At the very beginning of his treatise, Caves reacts 
to the Phys. Rev. Letters version, and comments on 
Zurek's subjective standpoint saying: 

"It is hard to tell from WHZ's (Zurek's, F. 
H.) discussion whether he sees his derivation 
as justifying the Born rule as the way for an 
observer to assign subjective probabilities or 
as the rule for objective probabilities that ad- 
here within a relative state." 

Later on, Caves quotes the same as in my first quo- 
tation in the subsection on Zurek's Phys. Rev. paper, 
and decides that " WHZ is thinking in terms of objective 
probabilities" . In the present version the subjective side 
of the problem is completely omitted, but it should be 
emphasized that this is not because it is not considered 
important. 

Though sometimes it is hard to see one's way through 
Zurek's "underbrush of verbiage" (as Caves says for 
Barnum) in his copious expositions (the exposition in 
the present article is probably no better) , it is clear that 
Zurek's approach to fundamental problems is rather 
all-encompassing. In particular, he, no doubt, recognizes 
that no thorough ontology can disregard epistemology. 
But in the latter, the observer's cognition is a reflection 
of reality. When an observer cannot distinguish two 
envariantly swapable states, e. g., this means, that they 
are objectively indiscernible, i. e., equal, etc. (I am sure, 
Caves sees the work of Zurek in a similar manner, but he 
seems to object to the way how Zurek unfolds his ideas.) 

On p. 2, Caves starts with a simple (non-composite) 
system A, and a non-trivial observable for it. He 
then points out that Zurek considers the unitary 
evolution corresponding to interaction with an ideally 
measuring apparatus B. (Ideal measurement is not 
only a non-demolition one, i. e., result preserving, but 
also eigen-state preserving, and, of course, probability 
preserving.) This fits well into the sixth stipulation of 
the present version, in which the closest suitable state 
is the Liiders state corresponding precisely to ideal 
measurement. 

Caves further says on p. 2: 
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"Notice that what I am saying is that in 
WHZ's approach, it is the Schmidt relative 
state that defines the notion of outcomes for 
system A; without the entanglement with 
system B , one cannot even talk about out- 
comes for the basis {| at)} (the eigenbasis 
of the measured observable, F. H.)." 

Zurck "derives" probabilities from entanglement, and 
the latter he displays in terms of a Schmidt decomposi- 
tion. No re-definition of events takes place here. (One 
can read in Zurek's Phys. Rev. article a detailed dis- 
cussion on how events, pointer states, etc. emerge from 
correlations.) 

Caves further says (on the same page): 

"... it has already been assumed that the 
probabilities that he is seeking ... have no de- 
pendence on the environmental states | bk) 
(partners of |afe) in the Schmidt decompo- 
sition, F. H.). This is a kind of foundational 
noncontcxtuality assumption that underlies 
the whole approach. I will call it environ- 
mental noncontextuality for lack of a better 
name." 

This is an attempt to view Zurek's derivation from 
another angle. In section III of this article a rather 
different, though essentially equivalent view was pre- 
sented. Perhaps, one should be reminded of it. The 
probabilities in subsystem A (to use Caves' notation 
for the first subsystem), though defined by the bipartite 
entangled state \iP}ab, are actually locally determined. 
Then the rest of the argument goes on in utilizing 
twin unitaries (the other face of cnvariance) to find 
this local determination. Naturally, by the very fact of 
local determination of subsystem probability (the first 
stipulation), the details of the opposite subsystem (the 
environment) don't really matter. Therefore, no empha- 
sis was put on Cave's "environmental non-contextuality" . 

On p. 3 Caves says: 

"WHZ wants to view envariance as the key 
to his derivation, but it is just a way to 
write the consequences of environmental non- 
contextuality, when they provide any useful 
constraints, in terms of system unitaries, in- 
stead of environment unitaries. It turns out 
not to be necessary to translate environmen- 
tal non-contextuality to system unitaries for 
any of the steps in the derivation." 

The last statement seems to be the most important 
one in Caves' article; it appears to be the program of his 
version of Zurek's argument. And he carries it out in 
the rest of his paper. 

In Caves' version, as in all the other versions, Schmidt 
decomposition is adhered to as the only widely known 



way how to handle entanglement. As a consequence, it 
turns out indispensable to put some probability in the 
environment, to get out probability in the system. It is 
assumption (3) in the article of Schlosshauer and Fine; 
Barnum calls it the Perfect Correlation Principle (same 
as " twin observables" in the work of the Belgrade group) ; 
Zurek uses it and emphasizes that probability-one state- 
ments are put in; Caves accepts Barnum's term. It con- 
sists simply in equal probabilities of the partners in a 
Schmidt decomposition. Both Barnum and Caves make 
use of the environment in a way that is more than nec- 
essary from the point of view of the present approach. 
Namely, on p. 4 Caves says: 

"The point is that WHZ's derivation depends 
on an unstated assumption that one can 
interchange the roles of systems A and B 
in the case of Schmidt states with amplitudes 
of equal magnitude." 



In contrast to the rest of the authors of versions com- 
mented upon so far, Caves couldn't readily accept the 
suitable extension of the environment to reduce unequal 
Schmidt coefficients to equal ones. On p. 6 he says: 

"We were originally told that the very notion 
of outcomes for system A required us to 
think about a joint pure state with the ap- 
propriate Schmidt decomposition. Now we 
are told that the notion of outcomes requires 
us to think about a much more complicated 
three-system joint state, where the two ad- 
ditional systems must have a dimension big 
enough to accommodate the rational approx- 
imation to the desired probabilities. Does 
this mean the notion of outcomes depends on 
the value of the amplitudes? This is a very 
unattractive alternative, so what we really 
must think is that for all amplitudes, the no- 
tion of outcomes requires us to think in terms 
of a big three-system joint state, where B 
and C have arbitrarily large dimensions. 
We are now supposed to believe that the no- 
tion of outcomes for system A requires us 
to think in terms of two other systems corre- 
lated in a particular way, which has no appar- 
ent relation to the number of outcomes of sys- 
tem A. Even a relative-state believer would 
find this hard to swallow, and it makes the 
Perfect Correlations Principle assumption far 
less natural, because this construction wrecks 
the nice-looking symmetry between A and 
the systems to which it is coupled and even 
between AB and C. It is a heck of a lot 
less attractive than the original picture we 
were presented and really should have been 
stated at the outset." 
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This rebellious passage of Caves was of great help in 
realizing that one should not confine oneself to unitaries 
of the opposite system that have a twin for the system 
under consideration treating locality. Also broader 
opposite-subsystem unitaries cannot change what is 
local in the system (see the second stipulation in section 
III of this article), and hence are part of the definition 
of the subsystem state and local properties. Then 
interaction with a suitable ancilla, which takes place in 
terms of such a unitary comes natural, and subsystem 
A of the enlarged system A + BC that Caves is 
objecting to still has the same locality or subsystem 
state, and the same subsystem probabilities. 

Caves closes his consideration on p. 6 saying: 

" In the end one is left wondering what makes 
the envariance argument any more com- 
pelling than just asserting that a swap sym- 
metry means that a state with equal ampli- 
tudes has equal probabilities and then moving 
on to the argument that extends to rational 
amplitudes." 

One should bear in mind that the swap symmetry is 
equivalent to symmetry under the group of twin uni- 
taries, which is, in turn, equivalent to the essence of the 
envariance argument. 

Finally, it should be pointed out that the need for 
broader opposite-subsystem unitaries than just those 
U 2 that have a twin U\ (see the second stipulation 
in the present version) is not the only thing that has 
been learnt from Caves' article [![. His comments raised 
the question how to extend Zurek's argument to isolated 
systems. (A solution using continuity is presented in the 
present approach.) 



V. CONCLUDING REMARKS 

There are some points that require additional clarifi- 
cation and comment. 



A. Summing up the stipulations of the present 
version 

The FIRST STIPULATION is: (a) Though the given 
pure state | v E'}i2 determines all properties in the com- 
posite system, therefore also all those of subsystem 1, 
the latter must be determined actually by the subsystem 
alone. This is, by (vague) definition, what is meant by 
local properties. 

(b) There exist local or subsystem probabilities of all 
elementary events <fr) i {<p | i , (f>)i G 7i\. 

The SECOND STIPULATION is that subsystem or 
local properties must not be changeable by remote action, 
i. e., by applying a second-subsystem unitary Ui to 



^ ) 12 or any unitary [/23 applied to the opposite 
subsystem with an ancilla (subsystem 3 ). 

The most important part of the precise mathematical 
formulation of the second stipulation is in terms of twin 
unitaries (cf (8a)). No local unitary U\ that has a twin 
U2 must be able to change any local property. 

The er-additivity rule of probability is the THIRD 
STIPULATION. It requires that the probability of ev- 
ery finite or infinite sum of exclusive events be equal to 
the same sum of the probabilities of the event terms. 

The FOURTH STIPULATION: Every state vector 
4>)i that belongs to the null space of p\ (or, equiva- 
lently, when |0)i(<^>|i acting on | 12 , gives zero) has 
probability zero. (The twin unitaries do not influence each 
other in the respective null spaces, cf (9a, b). Hence, this 
assumption is independent of the second stipulation.) 

The FIFTH STIPULATION: the sought for probabil- 
ity rule is continuous in pi, i. e., if p\ = \an. n ^ >O0 p™, 
then p(Ei,pi,X) = \im n ^ oc p(E 1 , p\, X), for every 
event (projector) E\, and X stands for the possi- 
ble yet unknown additional entity needed for a complete 
local probability rule. Further we assume that X, if it 
exists, does not change in the convergence process. 

The SIXTH STIPULATION: Instead of p u of which 
the given state \ 4>)i is not an eigen-state, we take a 
different density operator p[ of which \ 4>}i is an 
eigenvector, i. e., for which p[ \4>)i — r' \<p)i is valid, 
and which is closest to p\ as such. We stipulate that 
the sought for probability is r' . 

Comparing the stipulations to Zurek's facts (sixth 
quotation in subsection IV. C), we see that facts 3 and 2 
strictly correspond to the first stipulation (a). (Fact 1 
is connected with answering the question in subsection 
V.G.) 

Let us compare the 6 stipulations with the 4 assump- 
tions of Schlosshauer and Fine (cf the third quotation 
from their article). Assumption (1) is not among the 
former, because I understand Zurek's starting point is 
quantum logical, and so is mine. Zurek does not seem to 
consider observables, and neither am I. 

Assumption (3) is avoided because of the possible sus- 
picion that it is "putting probability in" (cf the second 
quotation from Schlosshauer and Fine) though Zurek re- 
marks that it is no more than putting probability-one 
statements in. 

Three assumptions that, apparently, cannot be 
avoided, have been raised to the status of stipulations: 
that of c-additivity, that of null probability of the 
null-space vectors | 4>)\, and, finally that of conti- 
nuity. (The sixth stipulation in the present version is, 
of course, not covered by Schlosshauer and Fine be- 
cause they did not consider extending Zurek's argument.) 
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B. Non-contextuality in the quantum logical 
approach 

(a) The event non-contextuality. From the quantum 
logical point of view, the elementary events occur in only 
one way. There is no question of context. But on ac- 
count of the implication relation in the structure of all 
events (the projector E implies the projector F, i. 
e., E < F if and only if EF = E ) every composite 
event can occur as a consequence of the occurrence of 
different elementary events that imply it. Nevertheless, 
the probability does not depend on this. 

As a matter of fact, the probabilities of the composite 
events are in Section III of this article, following Zurek, 
defined in terms of mutually exclusive elementary events 
(orthogonal ray-projectors, each defined by a state vec- 
tor) using cr-additivity. 

(b) Non-contextuality with respect to observables. A 
given elementary (or composite) event can, in general, 
be the eigen-event (eigen-projector) of different ob- 
servables. (This, essentially, amounts to the so-called 
eigenvalue-eigen-state link.) Correspondingly, the event 
can occur in measurement of different observables. The 
probability of the event does not depend on this. 



C. Circularity? 

In the second quotation from the article of 
Schlosshauer and Fine, the curse of a " fundamental state- 
ment" that one cannot "get probability out" of a theory 
unless one " puts some probability in" should be valid also 
for the present version. It appears to be valid no more for 
the present version of Zurek's argument than for Glea- 
son's theorem. Namely, what both "put in" is the as- 
sumption that probability exists and that cr-additivity 
is valid for it. 

Let us return to Mohrhoff's attempt of a fatal blow 
at Zurek's argument in the last but one quotation from 
his article stating that entanglement itself is correlation 
of probabilities. Hence, using entanglement as a starting 
point means "putting probability in". No wonder that 
one " gets probability out" . 

One can hardly shatter Mohrhoff's criticism. It all 
depends on how much belief one is prepared to put in 
theory. Taking an extremely positivistic attitude, one 
can say that, e. g., "interference" is all that exists in 
the phenomenon when one sees it; "coherence" in the 
quantum mechanical formalism giving rise to interference 
is, according to such a point of view, just a part of the 
formalism without immediate physical meaning. 

If one decides, however, to allow some reality to 
theoretical concepts, then, in the case at issue, "en- 
tanglement" is a theoretical concept (the correlation 
operator in the present approach), a potentiality, which 
is believed to be real in nature. We can observe its 
consequence as correlation of probabilities, but it is more 



than that. 



D. The role of entanglement 

In the present version, entanglement enters through, 
what was said to be, the sole entanglement entity - the 
correlation operator U a (see the correlated subsystem 
picture in section II. A.). In terms of this entity the first 
theorem on twin unitaries (near the end of section II. A.) 
gives a complete answer to the question which unitaries 
have a twin, and which opposite-subsystem unitary is the 
(unique) twin. 

In section III, in unfolding the present version, the 
correlation operator (and hence entanglement) was not 
made use of at all. All that was utilized was the gen- 
eral form of a first-subsystem unitary that has a twin: 
Ui = Ej U iQi + UiQi, where U = £\ Q{ + Qi 
is the eigen-resolution of the unity with respect to (dis- 
tinct eigenvalues) of the reduced density operator pi ( = 

tr2 ( | ^E') 12 1 12 )^ j and Vj : U( is an arbitrary unitary 

in the eigen-subspace H(Q{) corresponding to the pos- 
itive eigenvalue Tj of p\ (cf (9a)). (In the necessity 
part of the proof, U a was not used; it was used only in 
the sufficiency part.) 

These unitaries (Zurek's envariance unitaries) are uti- 
lized to establish what are local or first-subsystem prop- 
erties, in particular, local probabilities. It immediately 
follows that any two distinct eigen-vectors correspond- 
ing to the same eigenvalue of p\ determine equal- 
probability events (cf Stage one in section III). Thus, 
envariance is made use of in the first and most important 
step of Zurek's argument in a completely assumption-of- 
probability-free way. 

Nevertheless, twin unitaries (envariance) is due to en- 
tanglement, and Zureks argument is based on the latter. 
Entanglement is, as well known, the basic staff of which 
quantum communication and quantum computation are 
made of. No wonder that entanglement is increasingly 
considered to be a fundamental physical entity. As an il- 
lustration for this, one may mention that preservation of 
entanglement has been proposed as an equivalent second 
law of thermodynamics for composite systems (cf Ref. 
[29[ | and the references therein). 

E. cr-additivity 

To get an idea how "heavy" the cr-additivity assump- 
tion for probability intuitively is, we put it in the form of 
a "staircase" of gradually strengthened partial assump- 
tions. 

The starting point is the fact is that if any event F 
occurs, the opposite event F 1 - ^ = (1 — F)j does 
not occur (in suitable measurement, of course). 
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1) It is plausible to assume that F + F = 1 has 
p(F) + p(F ) = 1 as its consequence in any quantum 
state. 

2) If E + F = G (all being events, i. e., projectors, 
and EF = ), then, in view of the fact that, e. g., 
F is the opposite event of E in G, i. e., F = 
E^G, and in view of assumption (1), it is plausible to 
assume that E + F — G implies p(E) + p(F) = p(G) 
in any quantum state. Obviously, assumption (2) is a 
strengthening of assumption (1). 

Lemma. Assumption (2) implies additivity for ev- 
ery finite orthogonal sum of events: Y2i Ei = G => 
YliP(Ei) = P(G) m an y quantum state. 

Proof. If the lemma is valid for n terms, then 

(n+l) n 

p{ E E ) + = 

i=l i=l 

n (n+l) 

P{H E ) +P(E(n + l))= £ K-^) S 
i=l i=l 

i. e., it is valid also for (n + 1) terms. By assumption, 
it is valid for two terms. By total induction, it is then 
valid for every finite sum. □ 

3) If G = liixin^oo F n and the sequence {F n : n = 
1,2,..., oo} is non-descending ( Vn : i^n+i) > F n <^> 
F( n +i)F n = F n ), then the assumption of continuity in 
the probability p(G) — lim„_ +00 p(F n ) is plausible (oth- 
erwise one could have jumps in probability and no event 
responsible for it). Assuming the validity of assumption 
(2), it implies 

oo n 

p{Y^E i )=p{ lim y>o = 

z — J 71— »-00 L * 



hm Y^p[E i )=Y^p{E i ), 

n— >oo * — » ^ — * 

i=l i=l 

i. e., cr-additivity ensues. 

If one wants to estimate how "steep" each of these 
"stairs" is, one is on intuitive ground burdened with feel- 
ing and arbitrariness. Assumption (1) seems to be the 
largest "step" (with respect to the stated fact that is its 
premise). Once (1) is given, assumption (2) (equivalent 
to additivity of probability) seems very natural, hence 
less "steep". The final assumption (3) seems even more 
natural, and hence least "steep". 

At one place Zurek admits that (1) is an assumption (cf 
the last-but-two quotation in the subsection on Zurek's 
article). One wonders if he can avoid to assume (2). 
Leaning on "the standard approach of Laplace" [3(| (sec- 
ond passage, right column, p. 18, 16]), in which "by 
definition" "the probability of a composite event is a ra- 
tio of the number of favorable equiprobable events to the 



total", property (2) of probability follows. Zurek seems 
to adopt this reasoning to a large extent within eigen- 
subspaces 1Z(Q{) of p\ (cf (7c) in this article). Thus, 
partially he can avoid to assume (2). But can he do this 
generally? 

The form ( </> | i pi | </> ) i of the probability rule 
achieved, following Zurek, in the present version (shortly, 
the present form), is equivalent to the (much more gen- 
erally looking) trace rule precisely on account of cr- 
additivity Taking an infinitely composite event E = 
Si=i I * ) ( * 1 1 cr-additivity allows to transform the 
present form into the trace rule: 

oo oo 

p(E) =J2(i\p\i) - E tr (<° I^D = tr ^- 

i=l i=l 

Thus, without cr-additivity the present form is not the 
standard probability rule. 

Besides, the argument just presented can appear in 
the very context of Zurek's argument. Let | be 
infinitely entangled, or, equivalently, let p± have an 
infinitely dimensional range. Further, let the above set 
{| i)\ : i = 1, 2, . . . , oo} (with index) be a set of eigen- 
vectors of pi (corresponding to different eigenvalues), 
but let they not span the whole range TZ(pi). With- 
out the validity of cr-additivity the present rule does 
not give an answer what is the probability p(E±,pi), 
where E\ = YmZi I 0i{* li ■ Thus, if one want the 
general form of the probability rule, and in the present 
version nothing less is wanted, then one must assume (2) 
and the continuity in (3). 

F. Zurek's argument and Gleason's theorem 

In an effort to tighten up Zurek's argument, his "small 
natural" and some tacit assumptions have been avoided 
as much as possible. The most disquieting consequence 
was raising cr-additivity to the status of a stipulation. 
This was no different than in Gleason's well known the- 
orem [3l[, which goes as follows. 

One assumes that one has a map associating a num- 
ber p from the doubly-closed interval [0, 1] with ev- 
ery subspace, or, equivalently, with every projector F 
(projecting onto a subspace) observing cr-additivity, i. 
e. 

pG>>) = Ep( f *) ( 24fl ) 

i i 

for every orthogonal decomposition (finite or infinite) of 
every projector. Then, for every such map, there exists 
a unique density operator p such that 

p{F) = tr(Fp) (246) 

for every projector (the trace rule). Thus, the set of all 
density operators and that of all quantum probabilities 
stand in a natural one-to-one relation. 
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Logically, this makes the other five stipulations (be- 
sides cr-additivity) in the present version of Zurek's ar- 
gument unnecessary. Barnum is on to this (see the above 
fourth quotation from his article), but his understand- 
ing seems to be that Zurek's assumption of additivity 
is weaker than that of Glcason. At least in the present 
version this is not so. 

Let us be reminded that in Stage one of section III ad- 
ditivity had to be used in concluding that if pi \<f>)i = 
Tj \4>)i, and the corresponding eigen-projector is Qj, 
projecting onto a dj-dimensional subspace (which is 
necessarily finite) , then the probability of \ 4>)i is 
PiQti/dj. 

Further, cr-additivity had to be used in Stage two to 
conclude that p(Q{) — rjdj, where also the fourth pos- 
tulate about zero probabilities from the (possibly infinite 
dimensional) null space of p\ had to be utilized. ("Had 
to be" means, of course, that "the present author saw no 
other way" . ) 

Zurek's argument is very valuable though we have the 
theorem of Gleason. Perhaps a famous dictum of Wigner 
can help to make this clear. When faced with the chal- 
lenge of computer simulations to replace analytical solu- 
tions of intricate equations of important physical mean- 
ing, Wigner has allegedly said "I am glad that your com- 
puter understands the solutions; but I also would like to 
understand them." 

Schlosshauer and Fine say (in the Introduction to their 
paper): 

" ...Gleason's theorem is usually considered as 
giving rather little physical insight into the 
emergence of quantum probabilities and the 
Born rule." 

As to the logical necessity of " the emergence of quan- 
tum probabilities" , it seems hopeless (unless if the prob- 
abilities would prove subjective, i. e., due to ignorance, 
like in classical physics, after all). Neither Gleason, nor 
Zurek, nor anybody else - as it seems to me - can de- 
rive objective quantum probability, in the sense to show 
that it necessarily follows from deterministic quantum 
mechanics. But, once one realizes from physical consid- 
erations that probability must exist, then one makes the 
logical assumption that it exists, and then one wonders 
what its form is. 

Gleason gives the complete answer at once in the form 
of the trace rule. One can then derive from it the other 
five postulates of the present version and more. To 
use Wigner's words, the mathematics in the proof of 
Gleason's theorem " understands" the uniqueness and the 
other wonders of the quantum probability rule, but we 
do not. 

Now, the extra 5 stipulations in the present version 
(besides cr-additivity), though logically unnecessary 
in view of Gleason's theorem, nevertheless, thanks to 
Zurek's ingenuity, help to unfold before our eyes the 
simplicity and full generality of the quantum rule in the 



form (0 1 p | <fi) (equivalent to the trace rule) . 



G. Why unitary operators? 

Both envariance and its other face, unitary twins, are 
expressed in terms of unitary operators. One can raise 
the question in the title of the subsection. 

The answer lies in the notion of distant influence. One 
assumes that the nearby subsystem 1 is dynamically 
decoupled from another subsystem 2, but not statis- 
tically. Quantum correlations are assumed to exist be- 
tween the two subsystems. On account of these correla- 
tions one can manipulate subsystem 2 in order to make 
changes in subsystem 1 (without interaction with it). 
By definition, local are those properties of the nearby 
subsystem that cannot be changed by the described dis- 
tant influence. Probabilities of events on subsystem 1 
were stipulated to be local. 

One is thinking in terms of so-called bare quantum 
mechanics, i. e., quantum mechanics without collapse. 
Then all conceivable manipulations of the distant subsys- 
tem are unitary evolutions (suitable interactions of suit- 
ably chosen subsystems - all without any interaction with 
subsystem 1 ). As Zurek puts it in his Fact 1 (sixth 
quotation in subsection IV. C): "Unitary transformations 
must act on the system to alter its state." (This goes 
for the distant subsystem which should exert the distant 
influence.) 

Unitary evolution preserves the total probability of 
events. The suspicion has been voiced that the restric- 
tion to unitary operators might just be a case of "putting 
in probability in order to get out probability" 32j . Even 
if this is so, it appears to be even milder than Zurek's 
" putting in" probability-one assumptions (cf last passage 
in subsection B.l in |l6|). 

One may try to argue that the unitarity of the evo- 
lution operator (of the dynamical law) does not contain 
any probability assumption. Namely, one may start with 
the Schrodinger equation, of which the unitary evolu- 
tion operator is the integrated form (from instantaneous 
tendency of change in a finite interval). At first glance, 
the Schrodinger equation has nothing to do with prob- 
abilities. But this is not quite so. The dynamical law, 
instantaneous or for a finite interval, gives the change of 
the quantum state, which is, in turn, equivalent to the 
totality of probability predictions. 

Perhaps one should not expect to derive probabilities 
exclusively from other notions (cf the second quotation 
from Ref. 2 in subsection IV. A). 

APPENDIX A 

We prove now that the correlation operator U a is 
independent of the choice of the eigen-sub-basis of p\ 
(cf (5a)) that spans TZ(pi) in which the strong Schmidt 
decomposition of \^}i2 (cf (3c)) is written. 

Let {| j,k J ) 1 : Vfc^Vj} and {| j, : VI,-, Vj} be 
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two arbitrary eigen-sub-bases of p\ spanning TZ(pi). 
The vectors are written with two indices, j denoting 
the eigen-subspace 1Z(Q{) to which the vector belongs, 
and the other index kj ( lj ) enumerates the vectors 
within the subspace. 

A proof goes as follows. Let 

h 

where (Uk\^j are unitary sub-matrices. Then, keep- 
ing U a one and the same, we can start out with the 
strong Schmidt decomposition in the fcj-eigen-sub-basis, 
and after a few simple steps (utilizing the antilinearity of 
U a and the unitarity of the transition sub-matrices), we 
end up with the strong Schmidt decomposition (of the 
same |^)i2 ) in the Zj-eigen-sub-basis: 



i*>«=EE r i /2 iJ'*i)i(^iJ' fc ^ 

3 kj 



u, 



°(E< <; !«>>)]J 



i h V, 



(li,l2) are the unit element. Then the inverse of 
(J7i,[/ 2 ) has to be (U^ 1 ,U 2 ~ 1 ) , and it is the inverse 
from left and from right of the former, and it is the unique 
inverse as in a group it should be. But it is not obvious 
that (U^ , U^ 1 ) are twin unitaries. 

It is well known (and easy to see) that the set of all 
(bipartite) unitaries U12 that leave the given state 

4 r )i2 unchanged is a subgroup of all unitaries, the 
so-called invariance group of the vector. If {1/1,1/2) 
arc twin unitaries, then UiU^ 1 leaves \& )i2 
unchanged or envariant (cf (8a) and (8b)). Its inverse 
is (U1U2 1 )- 1 = Ui 1 (Uz 1 )- 1 . Then (fTf 1 , U^ 1 ) are 
twin observables. □ 

APPENDIX C 

Those linear operators A in a complex separa- 
ble Hilbert space are Hilbert-Schmidt ones for which 
tx(A^A) < 00 ( A* being the adjoint of A ). The 
scalar product in the Hilbert space of all linear Hilbert- 
Schmidt operators is ^A, B^j = tr(A^fl) (cf the Defini- 
tion after Theorem VI. 21 and problem VI. 48(a) in (l7j). 

The statement that p n converges to p in the topol- 
ogy determined by the distance in the Hilbert space of 
all linear Hilbert-Schmidt (HS) operators means: 

lim \\p - PuWhs = nm tr(p ~ Pn) 2 = 



lim y>fc| (p- Pn ) 2 \<P k ) = 0, 
k 

where {\4>k) ■ Vfc} is an arbitrary basis. 

On the other hand, the claim that p n converges to 
p in the strong operator topology means [l7] that 



V |V> : lim \\p\^)-Pn 



lim (ip\ (p-p n ) 2 =0. 



□ 



APPENDIX B 

We elaborate now the group of pairs of unitary twins. 

Let ({/{jC/g) and (1/1,1/2) be two pairs of twin uni- 
taries for a given bipartite state vector |^)i2, i- e., let 
U[ |*)i2 = U 2 |*}i2, and f/i I = U 2 I *>ia, be 
valid. Then, applying U2 to both sides of the former 
relation, exchanging the rhs and the lhs, and utilizing the 
latter relation, one has: 

U 2 U 2 |*)i 2 = U 2 U{ |*)l 2 = U[U 2 |*>12 = C/i'C/l |*)l2- 

Hence, (U[Ui, U2U2) are twin unitaries, and one can 
define a composition law as (^1,^2) x (Ui,U 2 ) = 
(U'-J/i, 1/21/2)- Naturally, the trivial twin unitaries 



Thus, the latter topology requires convergence to zero 
only for each vector separately (without any uniformity 
of convergence for some subset), whereas the former 
topology requires the same uniformly for any basis, 
moreover for their sum (which may be infinite). The 
former topology requires much more, and hence it is 
stronger. 
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Since I have profited immensely from the ideas of all the upshot of a collective effort. But for all its short- 
other participants in the "Born's rule from envariance" comings and possible failures I am the only one to blame, 
enterprise, the present version is, to a certain extent, 
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